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About This Manual 


Preface 


Read This First 


The purpose of this book is to present integrated solutions for typical 
TMS320C54x design issues. It combines a description of specific 
programming topics with code examples. The text discusses basic 
programming topics for the 54x digital signal processor (DSP). The code 
examples were created and tested in real time using the 54x evaluation 
module (EVM) as a platform. You may use these examples in developing your 
applications. 


How to Use This Manual 


The book is divided into two parts: topic information, provided in Chapters 1-9, 
and complete code examples, provided in Chapter 10. 


(j Topic information, Chapters 1—9: These chapters give you a framework of 
knowledge for programming the 54x. Before creating code, beginners 
may want to read these chapters entirely to understand why these issues 
must be addressed in certain ways. Advanced users may want to read 
only the topics relevant to specific code applications. 


(1 Complete code examples, Chapter 10: These examples elaborate on the 
code provided in Chapters 1-9. This code has been tested and can be run 
as is. 


Notational Conventions 
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This document uses the following conventions. 


(j Program listings and program examples are shown in a special font. 


Here is a sample program listing: 


STL A, *AR1+ ; Int_RAM(I)=0 
RSBX INTM ; Globally enable interrupts 
B MAIN_PG ; Return to foreground program 


(1 Throughout this book, the notation ’54x refers to the TMS320C54x and the 
TMS320VC54x. The notations 541, ’542, etc., refer to the TMS320C541, 
TMS320C542, etc. The notation 'LC548 refers to the TMS320LC548. 
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Related Documentation from Texas Instruments 


The following books describe the 54x and related support tools. To obtain a 
copy of any of these Tl documents, call the Texas Instruments Literature 
Response Center at (800) 477-8924. When ordering, please identify the book 
by its title and literature number. 


TMS320C54x DSP Reference Set (literature number SPRU210) is 
composed of four volumes of information, each with its own literature 
number for individual ordering. 


TMS320C54x DSP Reference Set, Volume 1: CPU and Peripherals 
(literature number SPRU131) describes the TMS320C54x 16-bit, 
fixed-point, general-purpose digital signal processors. Covered are its 
architecture, internal register structure, data and program addressing, 
the instruction pipeline, DMA, and on-chip peripherals. Also includes 
development support information, parts lists, and design considerations 
for using the XDS510 emulator. 


TMS320C54x DSP Reference Set, Volume 2: Mnemonic Instruction Set 
(literature number SPRU172) describes the TMS320C54x digital signal 
processor mnemonic instructions individually. Also includes a summary 
of instruction set classes and cycles. 


TMS320C54x DSP Reference Set, Volume 3: Algebraic Instruction Set 
(literature number SPRU179) describes the TMS320C54x digital signal 
processor algebraic instructions individually. Also includes a summary 
of instruction set classes and cycles. 


TMS320C54x DSP Reference Set, Volume 4: Applications Guide 
(literature number SPRU173) describes software and hardware 
applications for the TMS320C54x digital signal processor. Also includes 
development support information, parts lists, and design considerations 
for using the XDS510 emulator. 


TMS320C54x, TMS320LC54x, TMS320VC54x Fixed-Point Digital Signal 
Processors (literature number SPRSO039) data sheet contains the 
electrical and timing specifications for these devices, as well as signal 
descriptions and pinouts for all of the available packages. 


TMS320C54x Assembly Language Tools User’s Guide (literature number 
SPRU102) describes the assembly language tools (assembler, linker, 
and other tools used to develop assembly language code), assembler 
directives, macros, common object file format, and symbolic debugging 
directives for the ’C54x generation of devices. 
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Related Documentation from Texas Instruments 


TMS320C5xx C Source Debugger User’s Guide (literature number 
SPRUO099) tells you how to invoke the ’C54x emulator, EVM, and 
simulator versions of the C source debugger interface. This book 
discusses various aspects of the debugger interface, including window 
management, command entry, code execution, data management, and 
breakpoints. It also includes a tutorial that introduces basic debugger 
functionality. 


TMS320C54x Code Generation Tools Getting Started Guide (literature 
number SPRU147) describes how to install the TMS320C54x assembly 
language tools and the C compiler for the ’C54x devices. The installation 
for MS-DOS™, OS/2™, SunOS™, Solaris™, and HP-UX™ 9.0x systems 
is covered. 


TMS320C54x Evaluation Module Technical Reference (literature number 
SPRU135) describes the ’C54x EVM, its features, design details and 
external interfaces. 


TMS320C54x Optimizing C Compiler User’s Guide (literature number 
SPRU103) describes the ’C54x C compiler. This C compiler accepts 
ANSI standard C source code and produces TMS320 assembly 
language source code for the ’C54x generation of devices. 


TMS320C54x Simulator Getting Started (literature number SPRU137) 
describes how to install the TMS320C54x simulator and the C source 
debugger for the ‘C54x. The installation for MS-DOS™, PC-DOS™, 
SunOS™, Solaris™, and HP-UX™ systems is covered. 


TMS320 Third-Party Support Reference Guide (literature number 
SPRU052) alphabetically lists over 100 third parties that provide various 
products that serve the family of ’320 digital signal processors. A myriad 
of products and applications are offered—software and hardware 
development tools, speech recognition, image processing, noise 
cancellation, modems, etc. 


TMS320 DSP Development Support Reference Guide (literature number 
SPRUO011) describes the TMS320 family of digital signal processors and 
the tools that support these devices. Included are code-generation tools 
(compilers, assemblers, linkers, etc.) and system integration and debug 
tools (simulators, emulators, evaluation modules, etc.). Also covered are 
available documentation, seminars, the university program, and factory 
repair and exchange. 
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Trademarks 


Trademarks 


vi 
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Borland is a trademark of Borland International, Inc. 
HP-UX is a trademark of Hewlett-Packard Company. 
MS-DOS is a registered trademark of Microsoft Corporation. 


OS/2, PC/AT, and PC-DOS are trademarks of International Business 
Machines Corporation. 


PAL® is a registered trademark of Advanced Micro Devices, Inc. 
Solaris and SunOS are trademarks of Sun Microsystems, Inc. 


SPARC is a trademark of SPARC International, Inc., but licensed exclusively 
to Sun Microsystems, Inc. 


Windows is a registered trademark of Microsoft Corporation. 


320 Hotline Online, Tl, XDS510, and XDS510WS are trademarks of Texas 
Instruments Incorporated. 
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PRELIMINARY If You Need Assistance 


If You Need Assistance. . . 
— World-Wide Web Sites 


TI Online http://www.ti.com 

Semiconductor Product Information Center (PIC) —_http://www.ti.com/sc/docs/pic/home.htm 
DSP Solutions http://www.ti.com/dsps 

320 Hotline On-line™ http://www.ti.com/sc/docs/dsps/support.html 


North America, South America, Central America 


Product Information Center (PIC) (972) 644-5580 
TI Literature Response Center U.S.A. (800) 477-8924 
Software Registration/Upgrades (214) 638-0333 Fax: (214) 638-7742 

U.S.A. Factory Repair/Hardware Upgrades (281) 274-2285 

U.S. Technical Training Organization (972) 644-5580 

DSP Hotline (281) 274-2320 Fax: (281) 274-2324 Email: dsph@ti.com 
DSP Modem BBS (281) 274-2323 

DSP Internet BBS via anonymous ftp to ftp://ftp.ti.com/mirrors/tms320bbs 


Europe, Middle East, Africa 
European Product Information Center (EPIC) Hotlines: 
Multi-Language Support +33 13070 11 69 :+33 130701032 Email: epic@ti.com 
Deutsch +49 8161 80 33 11 or +33 1 30 70 11 68 
English +33 1 30 70 11 65 
Francais +33 13070 11 64 
Italiano +33 1 30 70 11 67 
EPIC Modem BBS +33 1 30 70 11 99 
European Factory Repair +33 4 93 22 25 40 
Europe Customer Training Helpline : +49 81 61 80 40 10 


Asia-Pacific 

Literature Response Center +852 2 956 7288 : +852 2 956 2200 
Hong Kong DSP Hotline +852 2 956 7268 : +852 2 956 1002 
Korea DSP Hotline +82 2551 2804 : +82 2 551 2828 
Korea DSP Modem BBS +82 2551 2914 

Singapore DSP Hotline : +65 390 7179 
Taiwan DSP Hotline +886 2 377 1450 : +886 2 377 2718 
Taiwan DSP Modem BBS +886 2 376 2592 


Japan 

Product Information Center +0120-81-0026 (in Japan) : +0120-81-0036 (in Japan) 
+03-3457-0972 or (INTL) 813-3457-0972 : +03-3457-1259 or (INTL) 813-3457-1259 

DSP Hotline +03-3769-8735 or (INTL) 813-3769-8735 : +03-3457-7071 or (INTL) 813-3457-7071 

DSP BBS via Nifty-Serve Type “Go TIASP” 


Documentation 


When making suggestions or reporting errors in documentation, please include the following information that is on the title 
page: the full title of the book, the publication date, and the literature number. 
Mail: Texas Instruments Incorporated Email: comments@books.sc.ti.com 
Technical Documentation Services, MS 702 
P.O. Box 1443 
Houston, Texas 77251-1443 


Note: When calling a Literature Response Center to order documentation, please specify the literature number of the 
book. 
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Contents 


A IMEFOGUCTION x40 5c pecan daedw beccat ade yatew taeda te waa taeea aera pierre ft-1] 
Presents an introduction to DSP systems, which consist of a DSP, optional external memory, 
and an analog front end. Describes the architecture of a typical DSP and an EVM. 


1.1 DSP OVCIVIOW ccc cc cis hae ehh dade da ket aasee aad eadassGducnitacne daanana es 


1.2  °54x Evaluation Module (EVM) Overview ............0. 00 e eens 
13. ‘Memory Interate::..cctceteratodetebeeetadere deeeh dese hdeeatesieb ed ene yen 
1.4 ’54x EVM External Memory Interface ............2. 0.0 cece eee eee 
2. ‘System :Stant-Up' 25 v0ccctcewendee cede neddeepialinedee belie aniline weedeceeties 


Presents typical options when using on-chip ROM, external 16-bit memory (EPROM), or 
bootloading from an 8-bit EPROM. Describes start-up conditions and initialization. 


2.1. On-Chip ROM/External 16-Bit EPROM ............0 0.0: c cece nee e cence eens D-2) 
2.2 Processor Initialization ev.cicexcopennes ove Whee ds Cheek meee ieee ak eas 2-3 
ol (c8 | 2 ee en eee eter errr eet eer err een eer: ere ee B-1| 


Describes the most common means by which data goes to and from the DSP. Discusses 
initialization of serial ports and the process of acquiring and transmitting data using interrupt 
service routines (ISRs). 

3.1. Synchronous Serial Port Devices .......... 00.0 teen eae 
3.2 TLC320AC01 Analog Interface Circuit 
B38 SOMWALE SACK «0s ec ceee de narnaie eteia aed pated ele ate da hereeauene aus eee eae 


3.4 Context Switching ........... 0. teen enna 
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3:6. (Interrupt Pronty eet cee ad eevee eee di ee hed Serra ey bees 
3./ Circular AddreSSiINg 2.0.0 ced yen coos cade eee na ee eee babe Pee eee Ga 
3.8 Buffered Serial Port .......... 0... eet ene eens 
4 ‘Signal Processing) «ciece cece sce eee eee eee see eee eee eee eens 


Discusses digital filters that use both fixed and adaptive coefficients and real fast Fourier 
transforms. 


4.1 Finite Impulse Response (FIR) Filters .......... 0.00: cece eee 
4.2 Infinite Impulse Response (IIR) Filters .... 2.0... 0. eee eee 
43 Adaptive Filtering .............0. 2.0.2 ce eee nes 
4.4 Fast Fourier Transforms (FFTS) .......... 00000 ee cece cece e eee eee e eens 
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> ‘IResource Managemen 2.260. 6ic.cscancetatescadeocaddecddnevaunsa cen evaenewauins 5-1] 
Discusses the on- versus off-chip memory and how to handle requirements for more than 64K 
words of memory. 


5:1. Memory Allocation sacs.ceeredserersetats eyed seredae tease era vbeteseeterberaeda 
5.2 Overlay Management ............ 0c cece eens 
5.3 Memory-to-Memory Moves .......... 00: cece nett nets 
5.4 PowerManagement ................0 ccc eee eens 
6 Arithmetic and Logical Operations ..............0 cece eee eee eee 


Takes a look at both single- and extended-precision operations, for both fixed-point and 
floating-point examples. Discusses methods of bit manipulation and packing/unpacking data. 


6.1. Division and Modulus Algorithm 


6.2 Sines and Cosines ............ 0.0 e eens 
6.3: - ‘SQUare ROO .csinct dha noi deta vanes ntti tat eed eddy eae eee ee 
6.4 Extended-Precision Arithmetic ............ 0. ccc teens 
6.4.1. Addition and Subtraction 
6:42 ‘Multiplication s.i.4-202 .0¢- eet aircon ed heel oon beed obs dein iw aie 
6.5 Floating-Point Arithmetic .......... 0.0... c ccc teens 
6:6 Logical Operations  c.-22 cide uckaeed nade pike ee Pate ee Oded ee 
7 Application-Specific Examples ............0.0 cece cece eee eee eee 


Describes applications using codebook search for speech coding and Viterbi decoding for 
telecommunications. 


7.1. Codebook Search for Excitation Signal in Speech Coding ...............0-0000 eee 
7.2 Viterbi Algorithm for Channel Decoding ........... 0... cece eee 
8 Bootloader: 2cicc cece eceeseeees et es eee ee ete es eiwies ieee eee permed dese eee wee es B-1] 


Describes the process of loading programs received from the host, EPROMs, or other memory 
devices. Discusses various methods of bootloading and when to use them. 


8.1 Boot Mode Selection ....... 0.0.0. cee ene eee e eee n ene naes 
8.2 Host Port Interface (HPI) Boot Loading Sequence .................0: cee eee eee eee 
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9 Host—Target Communication ............0c cee eens 


Describes the communication process and the registers used in transmission between the '54x 
EVM and its host. 


9.1. Communication Channels 1.1.0.0... enn 9-2 
9.2 Handshake and Data Transfer ......... 2000s cece e eect e eee ee ete eee eee e teeny 9-6 


x PRELIMINARY 


PRELIMINARY Contents 
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Provides code examples from start-up initialization to signal processing developed for real-time 
operation on the ‘54x EVM. 


10.1. Running the Applications ......... 0.0... ccc eee teens 10-2 
10.2 Application Code ........... 0c tenets 0-4 
A_ Design Considerations for Using XDS510 Emulator ............:00cc cece eee eee [a-1] 


Describes the JTAG emulator cable and how to construct a 14-pin connector on your target 
system and how to connect the target system to the emulator. 
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B Development Support and Part Order Information ................:02 eee ee eee eee ees 


Provides device part numbers and support tool ordering information for the TMS320C54x and 
development support information available from TI and third-party vendors. 
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Defines terms and abbreviations used throughout this book. 
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Introduction 


The TMS320C54x is a fixed-point digital signal processor (DSP) in the 
TMS320 family. It provides many options for the design of telecommunication 
and wireless applications. It executes 50 million instructions per second (MIPS), 
providing high performance, low power consumption, and cost effectiveness. 


The code examples in this book were tested in real time using a 54x Evalua- 
tion Module (EVM) platform. This chapter introduces you to DSP and EVM 
architecture and describes the 54x memory interface mechanism and the 
EVM’s interface with memory devices. Since the 54x EVM contains a ’541 
DSP, the chapter discusses the 541 specifically. 
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1.1 DSP Overview 


The block diagram in Figure 1—1 represents a typical DSP system. It uses an 
analog interface, where an input signal is digitized and processed by the DSP 
and has an output terminal. The RAM and EPROM blocks make up the 
system’s memory. These blocks sometimes replace with DSP on-chip 
memory. For a stand-alone system, an EPROM bootloads the code during 
system power-up. The emulation interface can access the ’54x high-level 
language debuggers, factory-installed tests, and field diagnostics. The host 
can download program files or data through the emulation port. The host 
interface provides buffering, host I/O decode, and access control. 


Figure 1-1. Typical DSP System 
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1.2 °54x Evaluation Module (EVM) Overview 


The 54x EVM is a PC/AT™ plug-in half-card that consists of a 541 DSP, 
128K x 16 words of fast SRAM, and a TLC320AC01 analog interface chip 
(AIC). It also includes a programmer interface and a mouse-driven, window- 
oriented C source debugger. It provides a platform for using real-time device 
evaluation, benchmarking, and system debugging to design embedded systems. 


The EVM has all common software debugging features, including breakpoint, 
modify, load, and watch. User-defined windows can be set up to view device 
parameters. The 54x EVM comes with a full assembler/linker package and an 
expansion port for interfacing to peripherals. Other hosts, systems, and target 
boards can communicate with the EVM using the serial port connector, pro- 
vided on the board. The EVM allows you to prototype application software and 
hardware. The example code provided in chapter 10 uses 54x EVM as the 
hardware platform along with a host interface and demonstrates the various 
applications using signal processing techniques. 


Figure 1-2 shows the configuration of the 54x EVM. It interfaces to 128K 
words of zero wait-state static RAM. The EVM includes 64K words of zero wait- 
state program memory and 64K words of zero wait-state data memory. An 
external I/O interface supports 16 parallel I/O ports and a serial port. The host- 
target communication system provides a simple means to pass data between 
the target and host during real-time operation. The two channels, A and B, are 
single, 16-bit bidirectional registers, mapped into two I/O port locations. Chan- 
nel B has a 64-word deep FIFO buffer. The analog interface circuit (AIC) inter- 
faces to the 541 serial port. The codec is a TLC320AC01 AIC that provides 
a 14-bit A/D and D/A interface with 16 bits of dynamic range, and sampling 
rates up to 43.2 KHz. Two RCA connectors provide analog input and output for 
’"ACO1. 


Figure 1-2. Block Diagram of a ‘54x EVM 
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1.3 Memory Interface 


The ’54x has several types of memory interfaces. Each interface contains a 
16-bit address bus and 16-bit data bus signal lines. These transfer information 
and control interface operation. All of the interfaces are independent of one 
another, and different operations can be performed simultaneously on each 
interface. 


The external direct memory access (DMA) interface lets external devices cause 
the processor to give up the bus and control signals for DMA. The MSTRB signal 
is activated for program and memory accesses, and the IOSTRB signal is used 
for transactions with I/O ports (PORTR and PORTW instructions). The R/W sig- 
nal controls the direction of accesses. The external ready input signal (READY) 
and the software-programmable wait-state generators allow the processor to in- 
terface with memory and I/O devices of varying speeds. 


Two signals, HOLD and HOLDA, allow an external device to take control of the 
processor’s buses. The processor acknowledges receiving a HOLD signal 
from an external device by bringing HOLDA low. The RS signal initializes the 
internal ’541 logic and executes the system-initialization software. The PS, 
DS, and I/O select signals are used to select any external program, data, or 
I/O access. The connection of X1 and X2/CLKIN determines the clock source 
that drives the clock generator. The clock mode that operates the clock gener- 
ator is determined by the CLKMD(1-3) signals. 


Figure 1-3 shows the external interfaces on the 541. The device contains two 
independent bidirectional serial ports: serial port 0 and serial port 1. The analog 
interface circuit interfaces directly to the serial ports of the 541. The external flag, 
BIO, is used to monitor the status of the peripheral devices. The XF pin signals 
external devices via software. The 541 has four external, maskable user inter- 
rupts that external devices can use to interrupt the processor and one external, 
nonmaskable interrupt (NMI). These provide a convenient means to perform peri- 
odic I/O or other functions. 


The ’541 can be interfaced with EPROMs and static RAMs. The speed, cost, 
and power limitations imposed by an application determine the selection of 
memory device. If soeed and maximum throughput are important, the 541 can 
run with no wait states. In this case, memory accesses are performed in a 
single machine cycle. Slower memories can be accessed by introducing an 
appropriate number of wait states or slowing down the system clock. 
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Figure 1-3. External Interfaces on the 541 
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*54x EVM External Memory Interface 


The ’54x EVM includes 64K words of zero wait-state program memory and 
64K words of zero wait-state data memory, providing a total of 128K words of 
external memory. The PS line controls accesses to data and program memory. 
Each time the DSP accesses external program memory, the PS line goes low, 
driving the A16 address line low and forcing access to memory locations 
00000h—OFFFFh. Data memory accesses have no effect on the PS line. During 
data memory accesses, the PS line remains high. This means that the EVM 
accesses memory locations from 100000h — 1FFFFh. These accesses are 
tabulated in the truth table below. 


Access MSTRB PS Ds 
Program 0 0 1 
Data 0 1 0 
No access 1 1 1 


Figure 1—4 shows a diagram of program/data memory interfaces for the EVM. 
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Figure 1-4. ’54x EVM Interface to External SRAM for Program and Data Memory 
128K x 8 
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System Start-Up 


To successfully power up a system, you must have a clear understanding of 
reset signals, clock modes and sources, memory interfaces with and without 
wait states, and bank switching. Common methods to start up a program 
include using on-chip ROM, external 16-bit EPROM, and bootloading from an 
8-bit EPROM. This chapter examines these start-up methods and parameters 
and gives examples of initialization software. 
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2.1. On-Chip ROM/External 16-Bit EPROM 


2-2 


The ’54x program memory space can reside either on- or off-chip. On-chip 
memory can include RAM, ROM, and external SRAM; off-chip memory can 
include 16-bit EPROM. There are two common methods to execute a pro- 
gram: running from memory (which uses on-chip ROM or 16-bit external 
EPROM) or using the bootloader either serially or in parallel from a 16- or an 
8-bit EPROM. 


The system can run using a 16-bit external EPROM or on-chip ROM. With on- 
chip ROM, internal memory is enabled. The EVM uses 128K words of external 
memory, which includes 64K words each of program and data memory. If a 
program resides in the external memory space, the program space includes 
part of the 64K of program memory. The EVM has external SRAM that can 
emulate either on-chip ROM or external EPROM. Executing the code from on- 
chip ROM requires no wait states, whereas executing from external EPROM 
requires some wait states, depending upon the speed of the DSP and the 
speed of the EPROM. In either case, the program starts executing from the 
reset vector, FF80h. 


Program memory in on-chip ROM is configured internally, since the processor 
is in microcomputer mode when the MP/MC pin is low. For 16-bit EPROM, the 
program space is external. The processor is configured in microprocessor 
mode when MP/WC is high. 
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At power-up, the state of the ‘54x processor is partially defined, since all the 
bits in both status control registers are established. Thus, the processor is in 
a predefined condition at reset. Some of the conditions in these registers can 
be changed, depending upon the system configuration and the application. 
The reset signal puts the processor in a known state. In a typical DSP system, 
the processor must be initialized for proper execution of an application. 


Starting up the system from on-chip ROM or 16-bit EPROM is done by reset- 
ting the processor. In both cases, the processor is reset by applying a low level 
to the RS pin. Reset causes the processor to fetch the reset vector. Upon reset, 
the IPTR bits of the PMST register are all set to 1, mapping the vectors to page 
511 in program memory space. This means that the reset vector always re- 
sides at program memory location OFF80h. This location usually contains a 
branch instruction to direct program execution to the system initialization rou- 
tine. 


After a reset, the processor initializes the following internal functions: 


J] Stack pointer (SP) 

J Memory-mapped peripheral control registers (SWWSR and BSCR) 
1 Status registers (STO and ST1) 

J Control register (PMST) 


Some of the reset bits in the status and control registers can be changed during 
initialization. The ‘54x has a software stack and the stack pointer must be 
initialized. The data page pointer, which is initialized to page O during reset, 
must also be configured during initialization. The predefined and the remaining 
bits in STO, ST1, and PMST are initialized so that the processor starts execut- 
ing the code from a defined state. 


Software wait-state generators interface external memory devices to the pro- 
cessor. Programmable bank switching determines the bank size. The software 
wait-state register (SWWSR) and the bank-switching control register (BCSR) 
are initialized. 


Example 2-1 initializes the vector table. Example 2—2, on page 2-6, initializes 
the ’54x, where the processor branches to main_start upon reset. 
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Example 2-1. Vector Table Initialization — vectors.asm 


; TEXAS INSTRUMENTS INCORPORATED 


-mmregs 
- include “init_54x.inc” 

- include "main.inc” 

.ref SYSTEM _STACK 

.ref main_start 

.ref receive_intl 

.ref host_command_intl 


; Functional Description 


; This function initializes the vector table of 541 device 
j ------------------------------------~--------~------~-~-----~--- Sea ee 
-sect “vectors” 
reset: BD main_start ; RESET vector 
STM #SYSTEM_STACK, SP 
nmi: RETE 
NOP 
NOP 
NOP ; NMI~ 
; software interrupts 
sint17 -space 4*16 
sint18 -space 4*16 
sintl19 -space 4*16 
sint20 -space 4*16 
sint21 -space 4*16 
sint22 -space 4*16 
sint23 -space 4*16 
sint24 -space 4*16 
sint25 -space 4*16 
sint26 -space 4*16 
sint2 -space 4*16 
sint28 -space 4*16 
sint29 -space 4*16 
sint30 -space 4*16 
intO: RETE 
NOP 
NOP ; INTO 
NOP 
intl: BD host_command_intl ; Host interrupt 
PSHM STO 
PSHM STL ; INTL 
int2: RETE 
NOP 
NOP 
NOP 
tint: RETE 
NOP 
NOP ; TIMER 
NOP 
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Example 2-1. Vector Table Initialization — vectors.asm (Continued) 


Frinto: RETE ; Serial Port Receive 
NOP ; Interrupt 0 
NOP 
NOP 
xint0d: RETE ; Serial Port Transmit 
; Interrupt 0 
NOP 
NOP 
NOP 
rintls BD receive_intl ; Serial Port Receive 
PSHM STO ; Interrupt 1 
PSHM ST1 
xintls RETE ; Serial Port Transmit 
NOP ; Interrupt 1 
NOP 
NOP 
int3: RETE 
NOP 
NOP } INT3 
NOP 
end 
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Example 2-2. Processor Initialization — init_54x.asm 


; TEXAS INSTRUMENTS INCORPORATED 
-mmregs 
einclude “"init_54x.inc” 
-include "”main.inc” 


.ref d_frame_flag 

.cef RCV_INT1_DP 

.ref aic_init,serial_init,init_54,init_bffr_ptr_var 

.ref task_handler, evm_handshake, fifo_host_transfer 
STACK -usect "stack”,K_STACK_SIZE 
SYSTEM_STACK .set K_STACK_SIZE+STACK 

.def main_start 

.def SYSTEM_STACK 


7 Functional Description 
; This is the main function that calls other functions. 


-sect “main_prg” 
KKK KK KKK KK KKK KKK KK KKK KKK KK KKK KK KKK KKK KK KKK KK KKK KKK KK KKK KKK KKKKKKKKKK KK KKK KKK KK 
* The code initializes the 541 device, handshake between Target (DSP) 
* and the host (PC). Zeros all buffers, variables and init. pointers 
* 
* 


Initializes serial port, programs ACOQ1l registers for selecting sampling 
rate, gains etc. 
KKKKKKKKKKKKKKKKKKKK KKK KKK KKK KKK KKK KKK KKK KKK KKK KKK KKK KKK KKKKKKKKKKKKKKKKKKEKKK 


main_start: 


CALL init_54 ; initialize STO,ST1 PMST and 
;other registers 

oe K_HOST_FLAG = 1 

CALL evm_handshake ; EVM host handshake 

endif 

CALL init_bffr_ptr_var ; init tables,vars,bffrs,ptr 

CALL serial_init ; initialize serial_port 1 

CALLD aic_init 7 Configures ACO1 

LD #0,DP 

NOP 


KKK KKK KKK KKK KKK KK KKK KKK KK KKK KKK KKK KKK KK KKK KKK KKK KKK KKK KKK KKK KKK KKK KKK KK KKK KK 


* After enabling interrupts from the above, the real processing starts here 
* After collecting 256 samples from ACO1l a flag(d_frame_flag is set). 
* Handles the task initiated by the user and transfers the data to the 


* host. Keeps the sequence for ever !!!!! 
KKK KK KKK KKK KK KKK KK KKK KK KKK KKK KK KKK KKK KEK KKK KK KKK KKK KK KKK KKK KKK KKKKKKKKKKKKEKK 


start_loop 
LD #RCV_INT1_DP, DP ; vestore the DP 
loop: 
BITF d_frame_flag,1 ; if 256 samples are received 
BC loop, NTC ; if not just loop back 
CALL task_handler ; handles task scheduling 
CALL fifo_host_transfer ; EVM HOST interface 
B loop 
end 
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Example 2-2. Processor Initialization — init_54x.asm (Continued) 


* Filename: Main.inc 
* Includes all the constants that are used in the entire application 


K_0O .set 0 ; constant 

K_FIR_INDEX .set L ; index count 

K_FIR_BFFR .set 16 ; FIR buffer size 

K_negl .set =1h ; index count 

K_BIQUAD .set Zz ; there are 2 bi-quad sections 
K_IIR_SIZE .set 10 ; each bi-quad has 5 coeffs 
K_STACK_SIZE .set 200 ; stack size 

K_FRAME SIZE .set 256 ; PING/PONG buffer size 

K FRAME FLAG .set i ; set after 256 collected 

H FILT SIZE .set 128 ; H(z) filter size 

ADPT_FILT_SIZE .set 128 ; W(z) filter size 

K_mu .set Oh ; initial step constant 

K HOST_FLAG .set i ; Enable EVM_HOST interface 

K DEFAULT_ACO1 .set lh ;, Gefault ACO1 init 

* This include file sets the FFT size for the ’54x Real FFT code 

* Note that the Real FFT size (i.e. the number of points in the 

* original real input sequence) is 2N; whereas the FFT size is 

* the number of complex points formed by packing the real inputs, 

* which is N. For example, for a 256-pt Real FFT, K_FFT_SIZE 

* should be set to 128 and K_LOGN should be set to 7. 

K_FFT_SIZE .set 128 ; # of complex points (=N) 

K_LOGN -set T ; # of stages (=logN/1log2) 

K_ZERO_BK .set 0 

K_TWID_TBL_SIZE .set 128 ; Twiddle table size 

K_DATA_IDX_1 .set Z ; Data index for Stage 1 

K_DATA_IDX_2 .set 4 ; Data index for Stage 2 

K_DATA_IDX_3 .set 8 ; Data index for Stage 3 

K_FLY_COUNT_3 .set 4 , Butterfly counter for Stage 3 
K_TWID_IDX_3 .set 32 ; Twiddle index for Stage 3 

KKK KK KKK KK KKK KKK KEK KKK KKK KKK KKK KKK KKK KKK KK KKK KK KKK KKK KKK KKK KKK KKKKEKKKKKKAKKKK KKK KKKK 
* FILENAME: INIT54x.INC 

* This include file contains all the initial values of STO, ST1, PMST, SWWSR, BSCR registers 
7STO Register Organization 

a a a i a a i 
* [15 13 | 12 | 11 | 10 | 9 | 8 0| 

i a a a a a sc a aN a i a a a 
* | ARP | te | c | oval ovB| DP | 


KKK KKK KKK KKK KK KKK KKK KKK KKK KKK KK KKK KKK KKK KK KKK KKK KKK KKK KKK KKK KEK KKK KKK KKK KKK KKK KK KKK 


K_ ARP .set 000b<<13 ; ARP can be addressed from 000b -111b 
; reset value 

K_TC -set 1b<<12 ; TC = 1 at reset 

KC -set ib<<11 ;, C =1 at reset 

K OVA -set 1b<<10 ; OVA = 0 at reset, Set OVA 

K_OVB .set ib<< 9 ; OVB = 0 at reset, Set OVB 

K_DP .set O000000000b<<0; DP is cleared to 0 at reset 

K_STO .set K_ARP|K_TC|K_C|K_OVA|K_OVB|K_DP 


KKK KKK KKK KKK KK KKK KKK KKK KKK KK KKK KKK KKK KKK KKK KKK KKK KEK KKK KKK KK KKK KKK KKK 
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Example 2-2. Processor Initialization — init_54x.asm (Continued) 


;ST1 Register Organization 


+ + + F FN 


KKK KKK KKK KKK KK KKK KKK KKK KKK KKK KK KKK KKK KK KKK KKK KKK KKK KKK KKK KK KKK KKK KKK KK KKK KKK KKK KK 


K_BRAF -set Ob << 15 ; BRAF = O at reset 
K_CPL -set Ob << 14 ; CPL = 0 at reset 
K_XF -set 1b << 13 ; XF = 1 at reset 
K_HM -set Ob << 12 ; HM = 0 at reset 
K_INTM -set 1b << 11 ; INTM 
K_ST1_RESR -set Ob << 10 ; reserved 
K_OVM -set lb << 9 ; OVM = 0 at reset 
K_SXM -set lb << 8 ; SXM = 1 at reset 
K_C16 -set Ob << O07 ; C16 = 0 at reset 
K_FRCT -set lb << 06 ; FRCT = 0 at reset, 

; Set FRCT 
K_CMPT -set Ob << 05 ; CMPT = O at reset 
K_ASM -set 00000b << 00 ; ASM = 0 at reset 
K_ST1_HIGH .set K_BRAF|K_CPL|K_XF|K_HM|K_INTM|K_ST1_RESR|K_OVM|K_SXMK_ST1_LOW 
.set K_C16|K_FRCT|K_CMPT|K_ASM 
K_ST1 .set  K_ST1_HIGH|K_ST1_Low 


KKKKK KKK KKK KKK KKK KK KKK KKK KKK KKK KKK KKK KK KKK KKK KK KKK KKK KKK KKK KKK KKK KKK KKK KKK KKK KK KKK 
rd 


*PMST Register Organization 


+ F F F 


KKK KKK KKK KKK KKK KK KK KKK KKK KKK KKK KKK KK KKK KKK KKK KK KKK KKK KKK KKK KKK KKK KKK KKK KKK KKK KKK KK 


K_IPTR -set 111111111b << 07 ; 111111111b at reset 
K_MP_MC -set ib << 06 ; 1 at reset 

K_OVLY -set Ob << 05 ; OVLY = 0 at reset 
K_AVIS -set Ob << 04 ; AVIS = 0 at reset 
K_DROM -set Ob << 03 ; DROM = 0 at reset 
K_CLKOFF -set 0b << 02 ; CLKOFF = 0 at reset 
K_PMST_RESR -set 00b << 0 ; reserved 


> for 548 bit 0 = SMUL 

;saturation on multiply 

; bit 1 = SST = saturation on store 
K_PMST.set | K_IPTR|K_MP_MC|K_OVLY|K_AVIS|K_DROM|K_CLKOFF|K_PMST_RESR 


KKK KKK KKK KKK KK KKK KK KKK KKK KKK KKK KKK KK KKK KKK KKK KK KKK KKK KKK KKK KKK KKK KKK KKK KKK KK KKK KKK 
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Example 2-2. Processor Initialization — init_54x.asm (Continued) 


*SWWSR Register Organization 


+ + FF 
me 
uo 
R 
ad 
B 
i) 
BR 
R 
ie) 
fee) 
fon 
oO 
W 
iN) 
° 


* 


KKK KKK KKK KKK KK KKK KKK KKK KKK KKK KK KKK KKK KKK KK KKK KKK KKK KKK KKK KK KKK KKK KKK KK KKK KKK KKK KKK 


K_SWWSR_IO .set 2000h ; set the I/O space 


KKK KKK KKK KKK KKK KKK KKK KKK KKK KK KK KKK KKK KKK KK KKK KKK KKK KKK KKK KKK KK KKK KKK KKK KKK KKK KK KKK 


*Bank Switching Control Register (BSCR) Organization 


er a a a a a a a a a a Sa ee ee 
* [15 12 | 11 | 10 2 | @ | -o | 
a a a SS 
* | BNKCMP | PS-DS | Reserved | BH | EXIO | 
cr a 
KKEKKKKKKKKKKKKKKKKK KKK KKK KKK KKK KKK KKK KKK KKK KKK KKK KKK KKK KKK KKK KKK KKK KKK KKK KKK KKKKKE 
K_BNKCMP -set 0000b << 12 ; bank size = 64K 
K_PS_DS -set Ob << 11 
K_BSCR_RESR -set OO00000000b <<2 ; reserved space 
K_BH -set Ob << 1 i BH = 0 at reset 
K_EXIO set Ob << 0 EXIO = 0 at reset 
K_BSCR .set. K_BNKCMP|K_PS ole BSCR_RESR|K_BH|K_EXIO 
; TEXAS INSTRUMENTS INCORPORATED 

-mmregs 

einclude “init_54x.inc” 

.def init_54 


H Functional Description 
; Initializes the processor from a reset state 


-sect "main_prg” 
init_54: ; Init.the s/w wait state reg.for 2 wait states for I/O operations 
STM #K_SWWSR_IO, SWWSR ; 2 wait states for I/Ooperations 
; wait states for Bank Switch 
STM #K_BSCR, BSCR ; O wait states for BANK SWITCH 


; initialize the status and control registers 
STM #K_STO, STO 
STM #K_ST1, ST1 


STM #K_PMST, PMST 
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Analog I/O 


Most DSP systems transfer data through peripherals. These peripherals 
include parallel and serial ports. This chapter describes how the serial ports 
are initialized and how the TLC320AC01 (’AC01) analog interface circuit (AIC) 
interfaces to the 54x serial port. This chapter also describes the various issues 
involved such as stack, context switching, interrupt priorities, and different 
addressing modes for collecting the samples during the interrupt processing. 
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3.1 Synchronous Serial Port Devices 


Several ’54x devices implement a variety of types of flexible serial port inter- 
faces. These serial port interfaces provide full duplex, bidirectional, commu- 
nication with serial devices such as codecs, serial analog to digital (A/D) con- 
verters, and other serial systems. The serial port interface signals are directly 
compatible with many industry-standard codecs and other serial devices. The 
serial port may also be used for interprocessor communication in multiproces- 
sing applications. When the serial ports are in reset, the device can be config- 
ured to shut off the serial port clocks. This allows the device to run in alow-pow- 
er mode of operation. 


Three signals are necessary to connect the ’54x to the serial port, as shown 
in Figure 3-1. On the transmitting device, the transmit data signal (DX) sends 
the data, the transmit frame synchronization signal (FSX) initiates the transfer 
at the beginning of the packet, and the transmit clock signal (CLKX) clocks the 
bit transfer. 


The corresponding pins on the receiving device are the received serial data 
signal (DR), the receive frame synchronization signal (FSR) and, the receive 
clock signal (CLKR), respectively. At reset, CLKX, CLKR, DR, FSX, and FSR 
become inputs and DX is set for high impedance. Figure 3-1 shows the ’54x 
interface to an ’ACO1 that uses the serial port to transfer the data to and from 
the DSP. The SCLK signal clocks the serial data into and out of the device. 


Figure 3-1. Interfacing a TLC320ACO1C AIC to the ‘54x 


3-2 


10.368 MHz 
TMS320C54x TLC320AC01 


Serial port 
0/1 
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Figure 3—2 shows the master and slave-to-DSP interface. In the master-slave 
mode, the master ’AC01, generates its own shift clock and frame sync signals 
and generates all delayed frame-sync signals to support slave devices. The 
slave receives the shift clock and frame-sync signals from the master device. 


Figure 3-2. Master- and Slave-to-’54x Interfaces 


TMS320C54x 10.368 MHz TLC320AC01 
Serial port Master 
0/1 mode 
@ 
@ 
© 
@ 
TLC320AC01 
Slave 
mode 
o— 
@ 
@ 
@ 
A 
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The serial port operates through three memory-mapped registers: the serial 
port control register (SPC), the data transmit register (DXR), and the data 
receive register (DRR). The configuration of the serial port is changed by writ- 
ing to the SPC register. The system configuration is as follows: 


(1 Word length: The serial port must be programmed to use 16-bit data, since 
communications between the ’ACO1 AIC and the host DSP use 16 bits. 
This means clearing the format bit, FO, to a logic 0. 


(1 Frame synchronization: The system operates in burst mode with frame 
synchronization signals generated externally by the "AC0O1. This means 
the frame synchronization mode bit, FSM, must be set to a logic 1. A frame 
sync pulse is required on the frame sync transmit and receive pins (FSX 
and FSR, respectively) for the transmission and reception of every word. 
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Italso means that the FSX pin has to be configured as an input pin by clear- 
ing the transmit mode bit, TXM, to a logic 0. 


(1 Clock: The clock is generated externally by the "AC0O1 and is supplied to 
the DSP on the CLKX pin. This means clearing the clock mode bit, MCM, 
to a logic 0. 


To reset the serial port, two writes to the SPC register must occur. The first 
must write Os to the transmit reset (XRST) and receive reset (RRST) bits and 
the desired configuration to all the other bits. The second must write 1s to the 
XRST and RRST bits and write the desired configuration to all the other bits. 
The SPC register is written as 0000 0000 1100 1000 to pull the serial port out 
of reset. This is shown in Example 3-1 on page 3-7. 
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3.2 TLC320AC01 Analog Interface Circuit 
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The ’ACO1 is an audio-band processor. It integrates a band-pass switched- 
capacitor antialiasing input filter, 14-bit A/D and D/A converters, a low-pass 
switched-capacitor output reconstruction filter, sinx/x compensation, and a 
serial port for data and control transfers. The A/D and D/A channels operate 
synchronously, and data is transferred in 2s-complement format. It has nine 
internal registers that can be programmed for an application. 


There are three basic modes of operation: 


1) Stand-alone analog interface mode: The ’ACO1 generates the shift clock 
and frame synchronization signals for data transfers; it is the only AIC 
used. 


2) Master-slave mode: One ’AC01 serves as the master, generating the mas- 
ter shift clock and frame synchronization signals; the other ’ACO1s serve 
as slaves. 


3) Linear codec mode: The shift clock and frame synchronization signals are 
externally generated by the host. 


Software control of the AIC allows you to program the sampling frequency, cut- 
off frequencies for the low- and high-pass filters, and analog input and output 
gains at any time. No programming is needed when the default values of the 
’"ACO1 are satisfactory. 


Data transfers between the DSP and the ’AC01 are categorized as primary and 
secondary serial communications. During primary communication, the 14 most 
significant bits (MSBs), bits 2 through 15, specify sample data. If the two least 
significant bits (LSBs), bits 1 and 0, are 11, a subsequent 16 bits of control in- 
formation is received by the ’ACO1 as part of the secondary serial communica- 
tion interval. This control information programs one of the nine internal registers. 
During the secondary communication interval, the bits transmitted to the ’ACO1 
are defined as follows: 


1 Bits 15 and 14, which control phase shifting in certain applications, are 
usually Os. 


Lj Bit 13 decides whether the data is written to or read from a register. 


Lj Bits 12 through 8 define the address of the register to be programmed. 


(1 Bits 7 through 0 contain the data to be stored in a register for write operations. 


The bits received from the ’AC01 during the secondary communication interval 
are Os for bits 15 through 8, and the value of the register data for bits 7 through 
0. 
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The three programs that follow show the use of the ’AC01 in different circum- 
stances: Example 3-1 assumes that the user is satisfied with the default con- 
figuration of the ’ACO1 and proceeds to receive and send data, Example 3-2 
on page 3-12 shows how to program the ‘ACO1 for a particular configuration 
before datatransfer, and Example 3-3 on page 3-17 demonstrates serial com- 
munications while in a master-slave configuration. All three programs consist 
of a main routine which, apart from initializing the serial port and the interrupt, 
also resets the ’ACO1. 


Example 3-1 initiates a reset of the ’ACO1, initializes the serial port, and sets 
up interrupts. To reset the ’ACO1 properly, it must be held in reset for at least 
one MCLK cycle. With the ’ACO1 operating at 10.368 MHz, one MCLK cycle 
equals 96.45 ns. In all the examples, the total time spent between initiating a 
reset and pulling the ’ACO1 out of reset is 12 cycles. For a’54x DSP operating 
at 40 MHz, one cycle is 25 ns. This means that the ’ACO1 spends 300 ns in reset. 
This is roughly three times the specification’s recommended value and is safe, 
since it allows a proper reset. 


If a 50-MHz DSP is used, one cycle equals 20 ns; the same code keeps the 
’"ACO01 in reset for 240 ns. This is also safe, since it is approximately 2.5 times 
the specification’s recommended value. 


The ‘ACO1 can operate at a minimum clock frequency of 5.184 MHz. This im- 
plies that it must be held in reset for at least one MCLK cycle, or 192.9 ns. As- 
suming a processor speed of 50 MHz, the ’AC0O1 spends 12 cycles multiplied 
by 20 ns = 240 ns in reset. 


A safe margin holds the ’ACO1 in reset for at least twice the specification’s rec- 
ommended time. To allow the necessary time in reset, use a dedicated timer 
function or insert useful instructions to fill the time slot. After the ’AC01 is reset, 
the main program sits in a loop, waiting for a serial port interrupt. 


Example 3-1 initializes the serial port and uses the default initialization of the 
"AC01, which is in the master mode and transfers the data upon a receive inter- 
rupt. It is not necessary to initiate a secondary communication interval to pro- 
gram the ’ACO1. The serial port is configured between the cycles that the AIC 
is in reset. 
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Example 3-1. Default Initialization of 'ACO1 


, TEXAS INSTRUMENTS INCORPORATED 


-mmregs 
- include “interrpt.inc” 
-include init_ser.inc” 
AIC_VAR_DP -usect “aic_vars”,0 
aic_in_rst -usect "“aic_vars”,1 
aic_out_of_rst -usect "“aic_vars”,1 
.def serial_init 


, Functional Description 

7 This routine initializes the serial port 1 of 541. The serial port is put 
7 in reset by writing 0’s to RRST and XRST bits and pulled out of reset by 
7 writing 1’s to both RRST and XRST bits. This routine also puts the ACO1 

7 in reset and after 12 cycles the ACO1l is pulled out of reset. The serial 
7 port initialization is done during the 12 cycle latency of the ACO1 init. 


-sect "ser_cnfg” 

serial_init: 
LD #AIC_VAR_DP,DP ; initialize DP for aic_reset 
ST #K_0O, aic_in_rst ; bit 15 = 0 of TCR resets AIC 
PORTW aic_in_rst,K_TRGCR_ADDR ;write to address 14h (TCR) 


KKK KKK KKK KKK KK KKK KKK KKK KKK KKK KK KKK KKK KKK KK KKK KKK KKK KKK KKK KKK KKK KKK KKK KKK KKK KKK KKK 


*We need at least 12 cycles to pull the AIC out of reset. 


KKK KKK KKK KKK KK KKK KKK KKK KKK KK KKK KKK KKK KKK KK KKK KKK KKK KKK KKK KKK KKK KKK KKK KKK KKK KKK KKK 


STM K_SERIAL_ RST, SPC1 ;reset the serial port with 
70000 0000 0000 1000 

STM #K SERIAL OUT_RST, SPC1 ;bring ser.port out of reset 
70000 0000 1100 1000 

RSBX INTM 

LD 0,DP 

ORM # (K_RINT1|K_INT1),IMR ; Enable RINT1,INT1 
; 0000 0000 0100 0010 

LD #AIC_VAR_DP,DP j; restore DP 

STM (K_RINT1), IFR : clear RINT1 

STM K_0,DXR1 0000 0000 0100 0000 


; Pull the ACO1 out of reset - the ACO1 peguiies that it be held in reset for 
; 1 MCLK, which is equivalent to 96.45ns (based on an MCLK of 10.368MHz) 


ST K_8000, aic_out_of_rst , bit 15 = 1 brings AIC from reset 
RETD 

PORTW aic_out_of_rst, K_TRGCR_ADDR ;AIC out of reset 

.end 


* This include file includes the SPCl register configuration 
KKK KK KK KKK KKK KK KKK KKK KK KKK KKK KK KKK KEK KKK KKK KEK KKK KKK KEK KK KKK KKK KK KKK KKK KKKKEK 


PRELIMINARY Analog I/O 3:7 


TLC320AC01 Analog Interface Circuit PRELIMINARY 


Example 3-1. Default Initialization of ’ACO1 (Continued) 


* FILENAME: “INIT_SER.INC” 
SPC Register Organization 


; 

a a a a a a a a a a a a a a a a a a a i 

* 15 14 | 13 | 12 | 11 | 10 |9 | 8 | 

+ [--=--]-=--- |-------- )--------- |------ |------ |----|----- 

* FREE | SOFT| RSRFULL| XSREMPTY| XRDY | RRDY | IN1| INO | 

ei ea ee ee ee ee ee ee ee ee ee ee ee 

, 

* 7 6 | 5 |4 | 3 | 24/21 | o | 

*  [--=--]-=--- |-----]---- |-----|----]-----]-----| 

» |RRST | XRST| TxmM | McM| FSM | FO | DLB | RES | 

We eae a es ee ee ee a ee ee ee eee = 

* This include file includes the SPCl register configuration 

KKK KK KKK KK KKK KKK KK KKK KKK KK KKK KK KKK KKK KK KKK KK KKK KKK KK KKK KKK KKKKKKAKKKKKKKK 

;Bit Name Function 

70 Reserved Always read as 0 

el DLB Digital loop back : 0 -> Disabled, 1 _. Enabled 

72 FO Format bit: O - > data transferred as 8 bit bytes, 1 -> 16 bit 
words 

73. FSM Frame sync pulse: 0 -> serial port in continuous mode, 1 -> FSM 
is required 

74 MCM Clock mode bit: 0 -—> CLKX obtained from CLKX pin 1-> CLKX 
obtained from CLKX 

Hus) TXM Transmit mode bit: 0 -> Frame sync pulses generated externally 
and supplied on FSX pin, 1-> Internally generated frame sync 
pulses out on FSX pin 

76 XRST Transmit reset bit: 0 -> reset the serial port, 1-> bring 
serial port out of reset 

.7 RRST Receive reset bit: 0 -> reset the serial port, 1-> bring 
serial port out of reset 

78 INO Read-only bit reflecting the state of the CLKR pin 

Hee) IN1 Read-only bit reflecting the state of the CLKX pin 

710 RRDY Transition from 0 to 1 indicates data is ready to be read 

pail XRDY Transition from 0 to 1 indicates data is ready to be sent 

e 12. XSREMPTY Transmit shift register empty ( Read-only) 0 -> transistor has 
experienced underflow, 1-> has not experienced underflow 

41:3 RSRFUL Receive shift register full flag (Read-only) : 0 -> Receiv- 
er has experienced overrun, 1-> receiver has not experienced 
overrun 

714 SOFT Soft bit - 0 -> immediate stop, 1-> stop after word completion 

715 FREE Free run bit: 0 -> behavior depends on SOFT bit, 1-> free run 


regardless of SOFT bit 
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Example 3-—1.Default Initialization of ‘ACO1 (Continued) 


, 


The system has the following configuration: 
Uses 16-bit data => FO = 0 
Operates in burst mode => FSM = 1 
CLKX is derived from CLKX pin => MCM = 0 
Frame sync pulses are generated externally by the AIC => TXM = 0 
Therefore, to reset the serial port, the SPC field would have 
0000 0000 0000 1000 
To pull the serial port out of reset, the SPC field would have 
0000 0000 1100 1000 


K_O .set O0O0000000b << 8 ; bits 15-8 to 0 at reset 
K_RRST .set 0b << 7 ; First write to SPC1 is 0 
; second write is 1 
K_XRST .set Ob << 6 ; First write to SPC1 is 0 
, second write is 1 
K_TXM .set Ob << 5 
K_MCM .set Ob << 4 
K_FSM -set Ib << 3 ; Frame Sync mode 
K_ZERO .set Q00b << 0 
K_SERIAL_RST .set K_0|K_RRST|K_XRST|K_TXM|K_MCM|K_FSM|K_ZERO 
, first write to SPCl register 
K_ RRST1 -set ib << 7 ; second write to SPC1l 
K_XRST1 -set ib << 6 ; second write to SPC1l 
K_SERIAL_OUT_RST .set K_0|K_RRST1|K_XRST1|K_TXM|K_MCM|K_FSM|K_ZERO 
K_TRGCR_ADDR .set 14h ; Target/Status I/O address 
K_O -set Oh 
K_8000 .set 8000h , set bit 15 to pull AIC out 
; of reset 
* FILENAME: INTERRUPT.INC 


* 


541 Interrupt Mask Register (IMR) Organization 


This file includes the IMR and IFR configuration 


KKK KKK KKK KK KKK KKK KKK KKK KKK KKK KK KKK KKK KKK KK KKK KKK KKK KKK KKK KK KKK KKK KKK KKK KKK KKK KK KKK 


K_ 
K_ 
K_ 
K_ 
K__ 
K__ 
K_ 
K_ 
K_ 
K_ 


IMR_RESR -set 0000000b << 9 ; reserved space 

INT3 -set ib << 8 ; Gisable INT3 

XINT1 -set ip << 7 ; Gisable transmit interrupt 1 
RINT1 -set Ib << 6 ; enable receive interrupt 1 
XINTO .set ib <<. 5 ; Gisable transmit interrupt 0 
RINTO -set ib << 4 ; Gisable receive interrupt 
TINT .set Ib << 3 ; Gisable timer interrupt 

INT2 .set Ib << 2 ; Gisable INT2 

INTL .set ib << 1 ; Gisable INT1 

INTO -set ib << 1 ; enable INTO 
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Example 3-2 on page 3-12 demonstrates initiation of a secondary commu- 
nication interval for programming the ‘ACO1. The configuration of the "ACO1 
can be changed via its nine internal registers. To understand this, consider reg- 
isters 1 and 2 in the ’ACO1. 


The data content of register 1, also referred to as the register A, determines 
the divisions of the master clock frequency that produce the internal filter clock, 
FCLK. The default value of this register is 18 (decimal). The filter clock 
frequency is calculated using the equation: 


FCLK frequency = master clock frequency / (register A contents x 2) 


This means that for an MCLK of 10.368 MHz, the default value of the filter clock 
frequency is 


FCLK = 288 kHz 


This FCLK determines the —3 dB corner frequency of the low-pass filter which 
is given by: 


fp = FCLK/40 
The default value of fp is equal to 7.2 kHz. 


The data content of register 2 of the ’AC01, also referred to as register B, deter- 
mines the division of FCLK to generate the conversion clock, and is given by 
the equation: 


Conversion frequency = FCLK/(register B contents) 


The default value of register B is equal to 18 (decimal). Hence, the default value 
of the conversion (sampling) frequency is equal to: 


288 kHz /18 = 16 kHz 


This register also determines the —3 dB corner frequency of the high-pass 
filter, which is given by: 


fup = sampling frequency/200 
Hence, the default value of fyp is equal to 80 Hz. 


For a system that processes speech signals, the following parameters are 
desirable: 


Sampling frequency fs = 8.0 kHz 


Low-pass filter corner frequency, f_p = 3.6 kHz 
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Assume that the ’ACO1 uses a master clock frequency, MCLK, of 10.368 MHz. 
The ‘ACO1’s parameters are: 


f_p = FCLK/40 gives FCLK = 144 kHz 
FCLK = MCLK/(register A x 2) gives register A = 36 (decimal) 
fs = FCLK/B gives register B = 18 (decimal) 


This also means that with the specified sampling frequency, the —3 dB corner 
frequency of the high-pass filter changes to 8.0 kHz / 200 = 48 Hz. 


After the ’ACO1 has been reset in the main program, it makes a call to the rou- 
tine wrt_cnfg, which programs the new values of registers A and B as calcu- 
lated above. To prevent the occurrence of an interrupt while programming the 
’"AC01, wrt_cnfg disables all interrupts until the end of programming. After 
each word has been sent to the serial port, the code waits for the data to be 
copied from the data transmit register to the transmit shift register before it 
sends the next data. After the ’ACO1 has been programmed, the main routine 
waits for an interrupt. The service routines that transfer data between the 
memory buffers and the serial port transmit and receive registers remain the 
same. 
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Example 3-2. Initiation of a Secondary Communication Interval for Programming 


the ‘ACO1 
; TEXAS INSTRUMENTS INCORPORATED 

-mmregs 

- include “interrpt.inc” 

- include “init_ser.inc” 
AIC_VAR_DP -usect “aic_vars”,0 
aic_in_rst -usect “aic_vars”,1 
aic_out_of_rst -usect “aic_vars”,1 


.def serial_init 
; Functional Description 
; This routine initializes the serial port 1 of 541. The serial port is put 
; in reset by writing 0’s to RRST and XRST bits and pulled out of reset by 
7 writing 1’s to both RRST and XRST bits. This routine also puts the ACO1 
7 in reset and after 12 cycles the ACO1 is pulled out of reset. The serial 
; port initialization is done during the 12 cycle latency of the ACO1 init. 


-sect “ser_cnfg” 
serial_init: 


LD #AIC_VAR_DP, DP ; initialize DP for aic_reset 
ST #K_0O, aic_in_rst ; bit 15 = 0 of TCR resets AIC 
PORTW aic_in_rst,K_TRGCR_ADDR ;write to address 14h (TCR) 


KKK KKK KKK KKK KKK KKK KKK KK KK KKK KKK KKK KKK KK KKK KKK KKK KKK KKK KKK KKK KKK KKK KKK KKK KKK KKK KKK 


*We need at least 12 cycles to pull the AIC out of reset. 


KKKKK KKK KKK KKK KKK KKK KKK KKK KK KKK KKK KKK KK KKK KKK KKK KKK KKK KKK KK KKK KKK KKK KK KKK KKK KKK KK 


STM #K_SERIAL RST, SPC1 ;reset the serial port with 
70000 0000 0000 1000 
ST™ #K_SERIAL _OUT_RST, SPC1 ;bring ser.port out of reset 


70000 0000 1100 1000 


RSBX INTM 
LD #0,DP 
ORM # (K_RINT1|K_INT1),IMR ; Enable RINT1,INT1 
; 0000 0000 0100 0010 
LD #AIC_VAR_DP,DP ; restore DP 
STM # (K_RINT1),IFR ; clear RINT1 


STM #K_0,DXR1 7 0000 0000 0100 0000 
; Pull the ACO1 out of reset - the ACO1 requires that it be held in reset for 
; 1 MCLK, which is equivalent to 96.45ns (based on an MCLK of 10.368MHz) 


ST #K_8000, aic_out_of_rst ; bit 15 = 1 brings AIC from reset 
RETD 
PORTW aic_out_of_rst, K_TRGCR_ADDR ;AIC out of reset 
.end 
; TEXAS INSTRUMENTS INCORPORATED 
-mmregs 
- include “interrpt.inc” 
.ref wrt_cnfg , initializes ACO1 


.def aic_init 


, 
; Functional Description 

; This routine disables IMR and clears any pending interrupts before 
7 initializing ACO1. The wrt_cnfg function configures the ACO1 
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Example 3-2. Initiation of a Secondary Communication Interval for Programming 
the ‘ACO1(Continued) 


ale Init: 
CALLD wrt_cnfg ; initialize ACO1 
ANDM # (~K_RINT1|K_INT1),IMR ; Gisable receive_int1,INT1l 
ORM # (K_RINT1|K_INT1),IMR ; enable the RINT1, INT1 
RETD 
STM # (K_RINT1),IFR ; service any pending interrupt 


.end 
KKK KK KKK KK KKK KKK KKK KKK KKK KKK KKK KK KKK KKK KKK KEK KK KKK KKK KKKKKKKKKKKAKKKKKKK 


* This file includes the ACO1 registers initialization 

* All registers have 2 control bits that initiates serial communication 

* There are 2 communication modes - primary and secondary communications 

* During primary communication the control bits DOO and DO1 are 11 to request 

* for a secondary communication. In the secondary serial communications the 

* control bits D15 and D14 perform same control function as primary. 

* The R/W~ bit at reset is set to 0 placing the device in write mode. 

KKK KK KKK KK KKK KKK KEK KKK KK KKK KK KKK KKK KK KKK KK KKK KKK KK KKK KKK KKKKKKK KKK KK KKK 

K_NOP_ADDR .set 0 << 8 

K_REG_0O .set K_ NOP_ADDR 

K_A_ADDR .set 1 << 8 ; REG 1 address 

K_A_REG -set 36 

K_REG_1 ser K_A_ADDR|K_A_REG ; FCLK = 144KHz => A =24h 

K_B_ADDR .set 2 << 8 ; REG 2 address 

K_B_REG .set 18 

K_REG_2 .set K_B_ADDR|K_B_REG ; Sampling rate = 8KHz 

K_AA ADDR .set 3 << 6 ; Register 3 address 

K_AA_ REG .set 0 

K_REG_3 .set K_ AA ADDR|K AA REG ; ; no shift 

K_GAIN_ADDR .set 4 << 8 ; Register 4 address 

K_MONITOR_GAIN .set O0b << 4 ; Monitor output gain = squelch 

K_ANLG_IN_GAIN .set Olb << 2 ; Analog input gain = OdB 

K_ANLG_OUT_GAIN .set Olb << 0 ; Analog output gain = OdB 

K_REG_4 -set K_GAIN_ADDR|K_MONITOR GAIN|K ANLG_IN GAIN|K ANLG_OUT_GAIN 

K_ANLG_CNF_ADDR .set 5 << 8 ; Register 5 address 

K_ ANLG RESRV .set 0 << 3 ; Must be set to OK_HGH_ FILTER 
.set 0 << 2 ; High pass filter is enabled 

K_ENBL_IN .set Olb << 0 ; Enables IN+ and IN- 

K_ REG_5 “see K_ANLG_CNF_ADDR|K_ANLG RESRV|K_HGH_FILTER|K_ENBL_IN 

K DGTL_CNF_ADDR .set 6 << 8 ; Register 6 address 

K ADC_DAC .set QO << 5 ; ADC and DAC is inactive 

K_ FSD_OUT .set 0 << 4 ; Enabled FSD output 

K_16_BIT_ COMM .set 0 << 3 ; Normal 16-bit mode 

K_ SECND_COMM .set 0 << 2 ; Normal secondary communication 

K SOFT_RESET .set Q << 1 ; Inactive reset 

K_POWER_DWN .set 0 <= 0 ; Power down external 

K_REG_HIGH_6 .set| K_DGTL_CNF_ADDR|K_ADC_DAC|K_FSD_OUT|K_16_BIT_COMM 

K_REG_LOW_6 .set| K_SECND_COMM|K_SOFT_RESET|K_POWER_DWN 

K_ REG_6 asec K_REG_HIGH 6|K REG_LOW_6 

K_FRME_SYN_ADDR .set 7 << 6 ; Register 7 address 

K_FRME_SYN .set O18 ; 

K_ REG_7 Ser K_FRME_SYN ADDR|K FRME_SYN 

K_FRME_NUM_ADDR .set 8 << 8 ; Register 8 address 

K_FRME_NUM -set 0 << 8 7 
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Example 3-2. Initiation of a Secondary Communication Interval for Programming 
the ‘AC01(Continued) 


K_REG_8 .set K_FRME_NUM ADDR|K FRME_NUM 
; primary word with DO1 and DOO bits set to 11 will cause a 
7 secondary communications interval to start when the frame 
7 sync goes low next 
K_SCND_CONTRL.set llb << 0 ; Secondary comm.bits 
AIC_REG_START_LIST .sect "aic_reg” ; includes the aic table 
-word AIC_REG_END_LIST-AIC_REG_START_LIST-1 
-word K_REG_1 
-word K_REG_2 
-word K_REG_3 
-word K_REG 4 
-word K_REG_5 
-word K_REG_6 
-word K_REG_7 
-word K_REG_8 
AIC_REG_END_LIST 
K_XRDY .set 0800h ; XRDY bit in SPC1 
; TEXAS INSTRUMENTS INCORPORATED 
-mmregs 
-include “interrpt.inc” 
-ref wrt_cnfg ; initializes ACO1 
.def aic_init 


; Functional Description 
i This routine disables IMR and clears any pending interrupts before 
; initializing ACO1. The wrt_cnfg function configures the ACO1 
-sect “aic_cnfg” 
aic_init: 


CALLD wrt_cnfg ; initialize ACO1 
ANDM # (~K_RINT1),IMR ; Gisable receive_intl 
ORM # (K_RINT1|K_INT1),IMR ; enable the RINT1, INT1 
RETD 
STM # (K_RINT1),IFR ; service any pending interrupt 
.end 
; TEXAS INSTRUMENTS INCORPORATED 
-mmregs 


-include "aic_cfg.inc” 
aic_reg_tble .usect "aic_vars”,10 
.def wrt_cnfg 


7 Functional Description 


: Writes new configuration data into the ACO1. Assuming a system 

H which processes speech signals and * requires the following parameters 
: Low pass filter cut-off frequency = 3.6 kHz 

; Sampling rate = 8000 Hz 

7 Assume the Master clock MCLK = 10.368 MHz 

; This example demonstrates how to program these parameters - 

; the registers affected are: 

; Register A which determines the division of the MCLK frequency 

7 to generate the internal filter clock FCLK. 

; It also determines the -3 dB corner frequency of the low-pass filter 
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Example 3-2. Initiation of a Secondary Communication Interval for Programming 
the ‘AC01(Continued) 


7 Register B which determines the division of FCLK to generate 
7 the sampling (conversion) frequency 
; It also determines the -3dB corner frequency of the high-pass filter 


-asg AR1,AIC_REG_P 
-sect “aic_cnfg” 


STM #aic_reg_tble,AIC_REG_P yj init AR1 

RPT #AIC_REG_END_LIST-AIC_REG_START_LIST 

MVPD #AIC_REG_START_LIST, *AIC_REG_Pt+t+ ; move the table 

STM #aic_reg_tble,AIC_REG_P ; init AR1 

STM #K_REG_0O,DXR1 ; primary data word - 
7 a jump start! 


wait_xrdy 


BITF SPC1,K_XRDY ;test XRDY bit in SPCl 
BC wait_xrdy,NTC ;loop if not set 
STM #K_SCND_CONTRL, DXR1 ;send primary word with 

; DO1-DOO = 11 to 

;Ssignify secondary communication 
LD *AIC_REG_P+,A 
STLM A,BRC ; gives the # of registers to be 
NOP 


; initialized 
RPTB aic_cfg_complte-1 
wait_xrdyl 


BITF SPC1,K_XRDY ;test XRDY bit in SPC1 
BC wait_xrdy1,NTC ;loop if not set 
LD *ATC_REG P+,A ; Read the register contents 
STLM A, DXR1 
wait_xrdy2 
BITF SPC1,K_XRDY ;test XRDY bit in SPC1 
BC wait_xrdy2,NTC ;loop if not set 
STM #K_SCND_CONTRL, DXR1 ; set to read the next register 
aic_cfg_complte 7 contents 
wait_xrdy3 
BITF SPC1,K_XRDY ;test XRDY bit in SPC1 
BC wait_xrdy3,NTC ;loop if not set 
RET 
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Example 3-3 on page 3-17 uses two ’AC01sin master-slave configuration. Af- 
ter the ’ACO1s have been reset, the AcO1s make a call to the routine wrt_cnfg. 
This routine programs the low- and high-pass filter cutoff frequencies, the 
number of slave devices attached, and sets up the frame-sync delay register. 
During the programming mode, the master and its slaves are programmed 
with the same control information. The last register programmed is frame-sync 
delay, which determines the lag between the frame-sync and delayed frame- 
sync signals. The minimum value for the frame-sync delay register is 18 (deci- 
mal). This is because every data transfer requires 16 clock periods, and the 
output data, DOUT, is placed in the high-impedance state on the 17th rising 
edge of the clock. 


Example 3-3 uses a minimum value of 18, which means that the time delay 
between the frame sync and the delayed frame sync is equal to 18 multiplied 
by the period of SCLK. With SCLK = 2.592 MHz based on an MCLK of 10.368 
MHz, the time delay is 6.94us. The hardware schematic indicates that the 
delayed frame-sync signal from the master is the frame-sync signal for the 
slave. The master, therefore, generates a frame-sync signal for itself and a 
delayed frame-sync signal for its slave. Itthen synchronizes the host serial port 
for data transfers to itself and its slaves. All interrupts on the host are disabled 
during the time the ’ACO1s are programmed. This disallows the host from re- 
ceiving or sending any data to or from the ’ACO1s. 


The output data (DOUT) from the ’ACO1 operating in the master mode is avail- 
able at the falling edge of the master frame-sync signal. In the primary commu- 
nication interval, this data represents the 14-bit A/D conversion result. The two 
least significant bits (LSBs), D01 and DOO, determine whether the data is gen- 
erated by the master or its slave as follows: 


DO1 DOO 
Master mode 0 0 
Slave mode 0 1 


In the secondary communication interval, the data available at DOUT repre- 
sents the contents of the register read with the eight MSBs set to 0 and the 
read/write bit set to logic 1 during the primary interval. If no register read is re- 
quested, the second word is all Os. 


The output data at DOUT from the ’ACO1 operating in slave mode is available 
at either the falling edge of the external frame sync signal (which is the delayed 
frame sync from the master) or the rising edge of the external SCLK signal, 
whichever comes first. 
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When the ACO01 receives an interrupt, the service routine first determines 
whether the received data is from the master or the slave by checking the LSBs 
of the received data at bit positions DO1 and DOO. If the LSBs are 00, the re- 
ceived data is stored in a memory buffer, which holds all data received from 
the master. The service routine also sends data from a memory buffer, which 
holds all data going the master. If the LSBs of the received data are 01, data 
is received from and sent to the slave ’AC01, using dedicated memory buffers. 
This can be used with double buffering or circular buffers to allow the ISR to 
notify the main routine. This occurs when new input data is available for pro- 
cessing and/or space is available to accept processed data for output to or 
from the master and the slave. 


Example 3-3. Master-Slave Mode 


KKK KKK KKK KKK KK KKK KKK KKK KKK KK KKK KKK KKK KKK KK KKK KKK KKK KKK KKK KK KKK KKK KKK KK 


+ + + + F FF FF F HF FH 


, 


, 


This file includes the ACO1 registers initialization 

All registers have 2 control bits that initiates serial communication 

There are 2 communication modes - primary and secondary communications 
During primary communication the control bits DOO and DO1 are 11 to request 
for a secondary communication. In the secondary serial communications the 
control bits D15 and D14 perform same control function as primary. 

The R/W~ bit at reset is set to 0 placing the device in write mode. 

DS7-DS0O0 -> Number of SCLK’s between FS~ and FSD~. When slaves are used, 
this should be the last register to be programmed The minimum value for this 
register should be decimal 18. This means that the time interval between the 
FS~ and FSD~ with an SCLK * frequency of 2.592 MHz (with an MCLK of 10.368 MHz) 
is 18 1/2.592 MHz = 6.94 us 

DS7-DS0O0 -> # of frame sync signals generated by ACO1 

Therefore, number of frame sync signals number of slave = devices + 1 

In programming the ACO1 REG7 is the last register to be programmed if the 
configuration is master salve mode configuration 


KKK KKK KKK KKK KK KKK KKK KKK KKK KKK KK KKK KKK KKK KKK KKK KKK KEK KKK KKK KKK KKK KKK KK KK 


K_ NOP_ADDR .set OQ << 8 

K_ REG_O -set K_NOP_ADDR 

K_ A ADDR .set 1 << 8 ; REG 1 address 

K_ A REG .set 36 

K_ REG_1 -set K_A_ADDR|K_A_REG ; FCLK = 144KHz => A =24h 

K_ B ADDR .set 2 << 8 ; REG 2 address 

K_B_ REG .set 18 

K REG 2 .set K_B_ADDR|K_B_REG ; Sampling rate = 8KHz 

K AA ADDR .set 3 << 8 ; Register 3 address 

K AA REG .set 0 

K REG_3 .set K_ AA ADDR|K AA REG ; no shift 

K GAIN _ADDR .set 4 << 8 ; Register 4 address 
K_MONITOR_GAIN .set O0b << 4 ; Monitor output gain = squelch 
K_ ANLG_IN_GAIN .set Olb << 2 ; Analog input gain = OdB 
K_ANLG_OUT_GAIN .set Olb << 0 ; Analog output gain = OdB 

K_ REG_4 -set K_GAIN_ADDR|K_MONITOR GAIN|K ANLG_IN GAIN|K ANLG_OUT_GAIN 
K_ANLG_CNF_ADDR .set 5 << 6 ; Register 5 address 

K_ ANLG RESRV .set 0 << 3 ; Must be set to 0 

K HGH FILTER .set Q << 2 ; High pass filter is enabled 
K_ENBL_IN .set Olb << 0 ; Enables IN+ and IN- 
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Example 3-3. Master-Slave Mode (Continued) 


K_REG_5 


K_DGTL_CNF_ADDR 


K_ADC_DAC 
K_FSD_OUT 


K_16_BIT_COMM 
K_SECND_COMM 
ET 
K_POWER_DWN 
K_REG_HIGH_6 


K_SOFT_RES 


K_REG_LOW_ 


6 


.set 
.set 
-set 
set 
Set 
+set 
.set 
«Set 
set 
Set 
»set 
.set 
«Set 
set 
-set 
-set 
.set 
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ESRV|K_HGH_FILTER|K_ENBL_IN 


RNORFPARRROCCCCOOCAN 


ANLG_CNF ADDR|K ANLG_R 

<< 8 ; Register 6 address 

<< 5 ; ADC and DAC is inactive 

<< 4 ; Enabled FSD output 

<< 3 ; Normal 16-bit mode 

<< 2 ; Normal secondary communication 
<< <1 ; Inactive reset 

<< 0 ; Power down external 
DGTL_CNF_ADDR|K_ADC_DAC|K_FSD_OUT|K_16_BIT_COMM 
SECND_COMM | K_SOFT RESET | K_POWER_DWN 
REG_HIGH_6]|K_REG_LOW_6 

<< 8 ; Register 7 address 

8 << 8 ; 

FRME_SYN. ADDR|K FRME_SYN 

<< 8 ; Register 8 address 

<< 8 ; 

FRME_NUM ADDR|K FRME_NUM 


; primary word with DO1 and DOO bits set to 11 will cause a 
7 secondary communications interval to start when the frame 
7 sync goes low next 


K_SCND_CONTRL 
AIC_REG_START_LIST 
ATC_R 
K_REG_1 
K_REG_2 
K_REG_3 
K_REG_4 
K_REG_5 
K_REG_6 
K_REG_8 
K_REG_7 


.word 
.word 
.word 
.word 
.word 
.word 
.word 
.word 
.word 


-set 


AIC_REG_END_LIST 


K_XRDY 


3-18 


-set 


llb << 0 


Sect 


“aic_reg” 


0800h 


; Secondary 


comm.bits 


; includes the aic table 
EG END_LIST-AIC_REG_START_LIST-1 


; this should be the last 
; register to be programmed 


; XRDY bit in SPC1 
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3.3 Software Stack 


A’54x device has a software stack whose location is determined by the con- 
tents of the stack pointer (SP). When using the software stack, you must 
accommodate anticipated storage requirements. The system stack fills from 
high to low memory addresses, as shown in Figure 3-3. The SP always points 
to the last element pushed onto the stack. A push performs a predecrement 
and a pop performs a postincrement of the system stack pointer. The stack is 
used during subroutine calls and returns and inside the subroutine as tempo- 
rary storage. 


The CALL, CALLD, CC, CCD, CALA, and CALAD instructions push the value 
of the program counter (PC) onto the stack. The RET and RETD instructions 
pops the stack and places its value into the program counter. The contents of 
any memory-mapped register can be pushed onto and popped from the stack, 
using the PSHM and POPM instructions. Two additional instructions, PSHD, 
and POPD, are included in the instruction set so that data memory values can 
be pushed onto and popped from the stack. 


Figure 3-3. System Stack 
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Low memory 


Free 


SP Top of stack 


Bottom of stack 


High memory 
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3.4 Context Switching 
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Before you execute a routine, you must save its context and restore the context 
after the routine has finished. This procedure is called context switching, and 
involves pushing the PC onto the stack. Context switching is useful for subrou- 
tine calls, especially when making extensive use of the auxiliary registers, 
accumulators, and other memory-mapped registers. 


Due to system and CPU requirements, the order of saving and restoring can 
vary. Some repeat instructions, such as RPTB, are interruptible. To nest repeat 
block instructions, you must ensure that the block-repeat counter (BRC), 
block-repeat start address (RSA), and block-repeat end address (REA) regis- 
ters are saved and restored. 


You must also ensure that the block-repeat active flag (BRAF) is properly set. 
Since the block-repeat flag can be deactivated by clearing the BRAF bit of the 
ST1 register, the order in which you push the block-repeat counter and ST1 
is important. If the BRC register is pushed onto the stack prior to ST1, any PC 
discontinuity in RPTB can give a wrong result, since BRAF is cleared in ST1. 
Thus, you must restore BRC before restoring the ST1 register. 


A context save complements the restored contents. To ensure the integrity of 
the code, determine what contents must be restored so that no sequencing is 
lost. 


PRELIMINARY 


PRELIMINARY Context Switching 


Example 3-4. Context Save and Restore for TMS320C54x 


-title “CONTEXT SAVE/RESTORE on SUBROUTINE or INTERRUPT 
CONTEXT_RESTORE .macro 
POPM PMST ;Restore PMST register 
POPM RSA ;Restore block repeat start address 
POPM REA ;Restore block repeat end address 
POPM BRC ;Restore block repeat counter 
POPM IMR ;Restore interrupt mask register 
POPM BK ;Restore circular size register 
POPM ST1 ,Restore ST1 
POPM STO ,Restore STO 
POPM ARO ,Restore ARO 
POPM ARI ;Restore AR1 
POPM AR2 ;Restore AR2 
POPM AR3 ,;Restore AR3 
POPM AR4 ;Restore AR4 
POPM AR5 ,Restore ARS 
POPM AR6 ,Restore AR6 
POPM AR7 ;Restore AR7 
POPM T ;Restore temporary register 
POPM TRN ;Restore transition register 
POPM BL ;Restore lower 16 bits of accB 
POPM BH ;Restore upper 16 bits of accB 
POPM BG ;Restore 8 guard bits of accB 
POPM AL ;Restore lower 16 bits of accA 
POPM AH ;Restore upper 16 bits of accA 
POPM AG ;Restore 8 guard bits of accA 
.endm 
CONTEXT_SAVE -macro 
PSHM AG ;Save 8 guard bits of accA 
PSHM AH ;Save upper 16 bits of accA 
PSHM AL ;Save lower 16 bits of accA 
PSHM BG ;Save 8 guard bits of accB 
PSHM BH ;Save upper 16 bits of accB 
PSHM BL ;Save lower 16 bits of accB 
PSHM TRN ;Save transition register 
PSHM T ;Save temporary register 
PSHM AR7 ;Save AR7 
PSHM AR6 ;Save AR6 
PSHM AR5 ;Save AR5 
PSHM AR4 ;Save AR4 
PSHM AR3 ;Save AR3 
PSHM AR2 ;Save AR2 
PSHM ARI ;Save ARI 
PSHM ARO ;Save ARO 
PSHM STO ;Save STO 
PSHM ST1 ;Save STl 
PSHM BK ;Save circular size register 
PSHM IMR ;Save interrupt mask register 
PSHM BRC ;Save block repeat counter 
PSHM REA ;Save block repeat end address 
PSHM RSA ;Save block repeat start address 
PSHM PMST ;Save PMST register 
.endm 
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The ’54x CPU supports 16 user-maskable interrupts. The vectors for interrupts 
not used by a 54x device can function as software interrupts, using the INTR 
and TRAP instructions. TRAP and INTR allow you to execute any of the 32 
available ISRs. You can define other locations in the interrupt vector table. The 
INTR instruction sets the INTM bit to 1, clears the corresponding interrupt flag 
to 0, and makes the IACK signal active, but the TRAP instruction does not. 
INTR and TRAP are nonmaskable interrupts. 


When a maskable interrupt occurs, the corresponding flag is set to 1 in the in- 
terrupt flag register (IFR). Interrupt processing begins if the corresponding bit 
in IMR register is set to 1 and the INTM bit in the ST1 register is cleared. The 
IFR register can be read and action taken if an interrupt occurs. This is true 
even when the interrupt is disabled. This is useful when not using an interrupt- 
driven interface, such as in a subroutine call when INT1 has not occurred. 


When interrupt processing begins, the PC is pushed onto the stack and the 
interrupt vector is loaded into the PC. Interrupts are then disabled by setting 
INTM = 1. The program continues from the address loaded in the PC. Since 
allinterrupts are disabled, the program can be processed without any interrup- 
tions, unless the ISR reenables them. Except for very simple ISRs, it is impor- 
tant to save the processor context during execution of the routine. 


During the time the ‘ACO01 is reset, the DSP initializes the serial port and sets 
up the interrupt. To set up the interrupts, it performs the following operations: 


[1 Enables unmasked interrupts by clearing the interrupt mode bit (INTM) 


[j Clears prior receive interrupts by writing the current contents of the 
appropriate receive interrupt flag in the IFR back to the IFR 


_j Enables receive interrupts by setting the appropriate receive interrupt flag 
in the interrupt mask register (IMR) 


The initialization of the IMR and IFR registers and the INTM bit is included in 
the serial port and the ’ACO01 initialization. 


Example 3-5 processes the receive interrupt 1 service routine. The routine 
collects 256 samples in the first buffer and changes the address to the second 
buffer for the next 256 samples while processing the first buffer. 
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Example 3-5. Receive Interrupt Service Routine 


; TEXAS INSTRUMENTS INCORPORATED 

-mmregs 

-include “”INTERRPT.INC” 

-include “main.inc” 
RCV_INT1_DP -usect "roev_vars”,0 
d_index_count -usect "rov_vars”,1 
d_rcv_in_ptr -usect "rov_vars”,1 ; save/restore input bffr ptr 
d_xmt_out_ptr -usect "rev_vars”,1 ; save/restore output bffr ptr 
d_frame_flag -usect "rov_vars”,1 
input_data -usect “inpt_buf”,K_FRAME_SIZE*2 ; input data array 
output_data -usect “outdata”,K_FRAME SIZE*2 ; output data array 

.def receive_intl 

.def d_frame_flag 

.def RCV_INT1_DP 

.def input_data, output_data 

.def d_xmt_out_ptr 

.def d_rcv_in_ptr 


, Functional Description 

: This routine services receive interruptl. Accumulator A, AR2 and AR3 

; are pushed onto the stack since AR2 and AR3 are used in other applications. 
- A 512 buffer size for both input and output. 

; After every 256 collection of input samples a flag is set to process the 

A data. No circular buffering scheme is used here. 

; After collecting 256 samples in the lst bffr, then the second buffer 

- address is loaded and collect data in the second buffer while processing 

; the first buffer and vice versa. 


-asg AR2,GETFRM_IN_P ; get frame input data pointer 
-asg AR3,GETFRM_OUT_P 7 get frame output data pointer 
-asg AR2, SAVE_RSTORE_AR2 
-asg AR3, SAVE_RSTORE_AR3 
-sect "main_prg” 
receive_intl: 
PSHM AL 
PSHM AH 
PSHM AG 
PSHM BL 
PSHM BH 
PSHM BG 
; AR2, AR3 are used in other routines, they need to be saved and restored 
; Since receive interrupt uses AR2 and AR3 as pointers 
PSHM SAVE_RSTORE_AR2 ; Since AR2 and AR3 are used 
PSHM SAVE_RSTORE_AR3 ; in other routines, they need 
PSHM BRC 
LD #RCV_INT1_DP, DP 7? Anite... DP 
MVDK d_rcv_in_ptr, GETFRM_IN_P ; restore input bffr ptr 
MVDK d_xmt_out_ptr,GETFRM_OUT_P ; restore output bffr ptr 
ADDM #1,d_index_count ; increment the index count 
LD #K_FRAME_SIZE,A 
SUB d_index_count, A 
BC get_samples, AGT ;check for a frame of samples 
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Example 3-5. Receive Interrupt Service Routine (Continued) 


frame_flag_set 


ADDM 1,d_int_count 

ST K_FRAME_FLAG,d_frame_flag ; set frame flag 

ST #0, d_index_count ; reset the counter 

LD #input_datat+K_FRAME_ SIZE,A 7 second input bffr starting addr 

LD foutput_datat+K_FRAME_SIZE,B 7 second output bffr starting addr 

BITF GAint count, 2 ; check for 1lst/2nd bffr 

BC reset_buffer,NTC 

SUB #K FRAME SIZE,A ; lst input address 

SUB #K FRAME SIZE,B ; lst output address 

ST K_0,d_int_count 
reset_buffer 

STLM A, GETFRM_IN_P ; input buffer address 

STLM B, GETFRM_OUT_P ; output buffer address 
get_samples 

LDM DRR1,A ; load the input sample 

STL A, *GETFRM_IN_P+ ; write to buffer 

LD *GETFRM_OUT_P+,A ; if not true, then the filtered 

AND #O0fffch,A ; signal is send as output 

STLM A,DXR1 ; write to DXR1 

MVKD GETFRM_IN_P,d_rcv_in_ptr ; save input buffer ptr 

MVKD GETFRM_OUT_P,d_xmt_out_ptr ; save out bffr ptr 

POPM BRC 

POPM SAVE_RSTORE_AR3 ; restore AR3 

POPM SAVE_RSTORE_AR2 ; restore AR2 

POPM BG 

POPM BH 

POPM BL 

POPM AG 

POPM AH 

POPM AL 

POPM ST1 

POPM STO 

RETE ; return and enable interrupts 

.end 
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3.6 Interrupt Priority 


Interrupt prioritization allows interrupts that occur simultaneously to be serviced 
in a predefined order. For instance, infrequent but lengthy ISRs can be inter- 
rupted frequently. In Example 3-6, the ISR for the INT 1 bit includes context save 
and restore macros. When the routine has finished processing, the IMR is re- 
stored to its original state. Notice that the RETE instruction not only pops the 
next program counter address from the stack, but also clears the INTM bit to 0. 
This enables all interrupts that have their IMR bit set. 


Example 3-6. Interrupt Service Routine (ISR) 


-title “Interrupt Service Routine” 


-mmregs 
intl: 
CONTEXT_STORE ; push the contents of accumulators and registers on stack 
STM #K_INTO, IMR ; Unmask only INTO~ 
RSBX INTM ; Enable all Interrupts 


, 


; Main Processing for Receive Interrupt 1 


; Disable all interrupts 


SSBX INTM 
CONTEXT_RESTORE 
RETE 

.end 


7 pop accumulators and registers 
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; return and enable interrupts 


There is a potential conflict between the INTM bit disable and context restore. 
If an interrupt O (INTO) occurs during context restore, the macro CON- 
TEXT_RESTORE is executed before servicing INTO. This can trigger an INTO. 
If INTM is cleared during the context restore, it branches to the INTO service 
routine. If you reenable the interrupts when INTM returns from INTO, a conflict 
occurs, because INTM is set to 0 and its original contents are lost. To preserve 
the contents of the INTM bit, do not enable the interrupts when INTM returns 
from the INTO service routine. During interrupt priorities, preserve the INTM 
and IMR bits for the system requirements. 
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3.7 Circular Addressing 


Circular addressing is an important feature of the ’54x instruction set. Algo- 
rithms for convolution, correlation, and FIR filters can use circular buffers in 
memory. In these algorithms, the circular buffers implement a sliding window 
that contains the most recent data. As new data comes in, it overwrites the old- 
est data. The size, the bottom address, and the top address of the circular buff- 
er are specified by the block size register (BK) and a user-selected auxiliary 
register (ARn). A circular buffer size of R must start on a K-bit boundary (that 
is, the K LSBs of the starting address of the circular buffer must be 0), where 
K is the smallest integer that satisfies 2K > R. 


Circular addressing can be used for different functions of an application. For 
example, it can be used for collecting the input samples in a block. It can also 
be used in processing samples in blocks and data in the output buffer. In 
Example 3-7, a frame of 256 samples is collected from the serial port to pro- 
cess the data using the circular addressing mode. The output from the pro- 
cessed block is sent to the D/A converter through the serial port register using 
circular buffers. A ping-pong buffering scheme is used. While processing the 
first buffer, samples are collected in the second buffer, and vice versa. The 
real-time operation of the system is not disturbed and no data samples are lost. 


Example 3-7. Circular Addressing Mode 


; TEXAS INSTRUM 

-mmregs 

- include 

- include 
RCV_INT1_DP 
d_index_count 
d_rcv_in_ptr 
d_xmt_out_ptr 
d_frame_flag 
input_data 
output_data 

.def 

.def 


ENTS INCORPORAT 


“INTERRPT. INC’ 
"main.inc” 


ED 


Ud 


-usect "“rev_vars”,0 
-usect "“rev_vars”,1 
-usect "“rev_vars”,1 
-usect "“rev_vars”,1 
-usect “rev_vars”,1 


-usect “inpt_buf”,K_FRAME_SIZE*2 


-usect “outdat 
receive_intl 
d_frame_flag 
RCV_INT1_DP 


a”,K_FRAME_SIZE*2 


input_data, output_data 


d_xmt_out_ptr 
d_rcv_in_ptr 


Functional Description 


This routine services receive interruptl. 


save/restore input bffr ptr 
save/restore output bffr ptr 


input data array 
output data array 


Accumulator A, AR2 and AR3 


A 512 buffer size of both input and output uses circular addressing. 
After every 256 collection of input samples a flag is set to process the 


a 
, 
, 
; are pushed onto the stack since AR2 and AR3 are used in other applications. 
, 
, 
z 


data. A PING/PONG buffering scheme is used such that upon processing 


; PING buffer, 


samples are collected in the PONG buffer and vice versa. 
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Example 3-7. Circular Addressing Mode (Continued) 


a 


a 


dtob 


get frame input data pointer 
get frame output data pointer 


saved and restored 


they n 


, 


a 


Nee Ne Ne Ne Ne 


inters 
Since AR2 and AR3 are used 
in other routines, they need 


circular buffer size of in,out 
arrays 

init. DP 

restore input circular bffr ptr 
restore output circular bffr ptr 
increment the index count 


;check for a frame of samples 


Ne Ne Ne Ne 


a 


-asg AR2,GETFRM_IN_P 

-asg AR3,GETFRM_OUT_P 

-asg AR2, SAVE_RSTORE_AR2 

-asg AR3, SAVE_RSTORE_AR3 

-sect “main_prg” 
receive_intl: 

PSHM AL 

PSHM AH 

PSHM AG 

PSHM BL 

PSHM BH 

PSHM BG 
; AR2, AR3 are used in other routines, 
; Since receive interrupt uses AR2 and AR3 as po 

PSHM SAVE_RSTORE_AR2 ; 

PSHM SAVE_RSTORE_AR3 

PSHM BRC 

STM #2*K_FRAME_ SIZE, BK 

LD #RCV_INT1_DP, DP 

MVDK d_rcv_in_ptr, GETFRM_IN_P 

MVDK d_xmt_out_ptr, GETFRM_OUT_P 

ADDM #1,d_index_count 

LD #K_FRAME_SIZE,A 

SUB d_index_count, A 

BC get_samples, AGT 
frame_flag_set 

ST #K_FRAME_FLAG, d_frame_flag 

ST #0,d_index_count 
get_samples 

LDM DRR1,A 

STL A, *GETFRM_IN_P+% 

LD *GETFRM_OUT_P+%,A 

AND #O0fffch,A 

STLM A,DXR1 

MVKD GETFRM_IN_P,d_rcv_in_ptr 

MVKD GETFRM_OUT_P,d_xmt_out_ptr 

POPM BRC 

POPM SAVE_RSTORE_AR3 

POPM SAVE_RSTORE_AR2 

POPM BG 

POPM BH 

POPM BL 

POPM AG 

POPM AH 

POPM AL 

POPM ST1 

POPM STO 

RETE 

end 
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set frame flag 
reset the counter 


load the input sample 

write to buffer 

if not true, then the filtered 
signal is send as output 

write to DXR1 

save input circular buffer ptr 
save out circular bffr ptr 


restore AR3 
restore AR2 


return and enable interrupts 


Analog I/O 3-27 


Buffered Serial Port 


PRELIMINARY 


3.8 Buffered Serial Port 


The buffered serial port (BSP) is made up of a full-duplex, double-buffered 
serial port interface, which functions in a similar manner to the ’54x standard 
serial port, and an autobuffering unit (ABU). The serial port section of the BSP 
is an enhanced version of the 54x standard serial port. The ABU is an addition- 
al section of logic which allows the serial port section to read/write directly to 
54x internal memory independent of the CPU. This results in a minimum over- 
head for serial port transfers and faster data rates. The full duplex BSP serial 
interface provides direct communication with serial devices such as codecs, 
serial A/D converters, and other serial devices with a minimum of external 
hardware. The double-buffered BSP allows transfer of a continuous commu- 
nication stream in 8-,10-,12- or 16-bit data packets. This section uses the 542 
device to verify the BSP functionality. 


The autobuffering process occurs between the ABU and the 2K-word block of 
ABU memory. Each time a serial port transfer occurs, the data involved is 
automatically transferred to or from a buffer in the 2K-word block of memory 
under control of the ABU. No interrupts are generated with each word transfer 
in autobuffering mode. Interrupts are generated to the CPU each time one of 
the half-boundaries is crossed. When autobuffering capability is disabled 
(standard mode), transfers with the serial port are performed under user con- 
trol (software). When autobuffering is enabled, word transfers can be done di- 
rectly between the serial port and ’54x internal memory using the ABU em- 
bedded address generators. The ABU has its own set of circular addressing 
registers with corresponding address generation units. The length and starting 
addresses of the buffers are user programmable. A buffer empty/full interrupt 
can be posted to the CPU. 


The six MSBs in the BSP control extension register (BSPCE) configure the 
ABU. Bits 14 and 11 are read only and the remaining bits are read/write. 
Figure 3—4 shows the bit positions of BSPCE and Table 3—1 provides a sum- 
mary of each bit. 


Figure 3—4. BSP Control Extension Register (BSPCE) Diagram 


14 


12 11 4-0 


R R/W 
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Table 3—1. BSP Control Extension Register (BSPCE) Bit Summary 


Bit Name 

15 HALTR 

14. RH 

13 BRE 

12. HALTX 

11 XH 
PRELIMINARY 


Function 


Autobuffering Receive Halt. This control bit determines whether autobuffering receive is 
halted when the current half of the buffer has been received. 


HALTR = 0 Autobuffering continues to operate when the current half of the buffer 
has been received. 


HALTR = 1 Autobuffering is halted when the current half of the buffer has been re- 
ceived. When this occurs, the BRE bit is cleared to 0 and the serial port 
continues to operate in standard mode. 


Receive Buffer Half Received. This read-only bit indicates which half of the receive buff- 
er has been filled. Reading RH when the RINT interrupt occurs (seen either as a pro- 
gram interrupt or by polling IFR) is a convenient way to identify which boundary has just 
been crossed. 


RH=0 The first half of the buffer has been filled and that receptions are cur- 
rently placing data in the second half of the buffer. 


RH =1 The second half of the buffer has been filled and that receptions are 
currently placing data in the first half of the buffer. 


Autobuffering Receive Enable. This control bit enables autobuffering receive. 


BRE =0 Autobuffering is disabled and the serial port interface operates in stan- 
dard mode. 
BRE = 1 Autobuffering is enabled for the receiver. 


Autobuffering Transmit Halt. This control bit determines whether autobuffering transmit 
is halted when the current half of the buffer has been transmitted. 


HALTX = 0 Autobuffering continues to operate when the current half of the buffer 
has been transmitted. 


HALTX = 1 Autobuffering is halted when the current half of the buffer has been 
transmitted. When this occurs, the BXE bit is cleared to 0 and the serial 
port continues to operate in standard mode. 


Transmit Buffer Half Transmitted. This read-only bit indicates which half of the transmit 
buffer has been transmitted. Reading XH when the XINT interrupt occurs (seen either as 
a program interrupt or by polling IFR) is a convenient way to identify which boundary 
has just been crossed. 


XH = 0 The first half of the buffer has been transmitted and transmissions are 
currently taking data from the second half of the buffer. 


XH = 1 The second half of the buffer has been transmitted and transmissions 
are currently taking data from the first half of the buffer. 
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Table 3-1. BSP Control Extension Register (BSPCE) Bit Summary (Continued) 


Bit 
10 
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Name 
BXE 


PCM 


FIG 


FE 


CLKP 


FSP 


Function 

Autobuffering Transmit Enable. This control bit enables the autobuffering transmit. 

BXE =0 Autobuffering is disabled and the serial port operates in standard mode. 

BXE = 1 Autobuffering is enabled for the transmitter. 

Pulse Code Modulation Mode. This control bit puts the serial port in pulse code modulation 

(PCM) mode. The PCM mode only affects the transmitter. BDXR-to-BXSR transfer is not 

affected by the PCM bit value. 

PCM =0 Pulse code modulation mode is disabled. 

PCM = 1 Pulse code modulation mode is enabled. In PCM mode, BDXR is trans- 
mitted only if its most significant (215) bit is set to 0. If this bit is set to 1, 
BDXR is not transmitted and BDX is put in high impedance during the 


transmission period. 


Frame Ignore. This control bit operates only in transmit continuous mode with external 
frame and in receive continuous mode. 


FIG =0 Frame sync pulses following the first frame pulse restart the transfer. 


FIG = 1 Frame sync pulses following the first frame pulse that initiates a transfer 
operation are ignored. 


Format Extension. The FE bit in conjunction with FO in the SPC register specifies the word 
length. When FO FE = 00, the format is 16-bit words; when FO FE = 01, the format is 10-bit 
words; when FO FE = 10, the format is 8-bit words; and when FO FE = 11, the format is 


12-bit words. Note that for 8-, 10-, and 12-bit words, the received words are right justified 
and the sign bit is extended to form a 16-bit word. Words to transmit must be right justified. 


Clock Polarity. This control bit specifies when the data is sampled by the receiver and trans- 
mitter. 


CLKP = 0 Data is sampled by the receiver on BCLKR falling edge and sent by the 
transmitter on BCLKX rising edge. 


CLKP = 1 Data is sampled by the receiver on BCLKR rising edge and sent by the 
transmitter on BCLKX falling edge. 


Frame Sync Polarity. This control bit specifies whether frame sync pulses (BFSX and 
BFSR) are active high or low. 


FSP =0 Frame sync pulses (BFSX and BFSR) are active high. 


FSP = 1 Frame sync pulses (BFSX and BFSR) are active low. 
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Table 3-1. BSP Control Extension Register (BSPCE) Bit Summary (Continued) 


Bit Name Function 


4-0 CLKDV Internal Transmit Clock Division factor. When the MCM bit of BSPC is set to 1, CLKX is driv- 
en by an on-chip source having a frequency equal to 1/(CLKDV+1) of CLKOUT. CLKDV 
range is 0-31. When CLKDV is odd or equal to 0, the CLKX duty cycle is 50%. When CLKDV 
is an even value (CLKDV=2p), the CLKX high and low state durations depend on CLKP. 
When CLKP is 0, the high state duration is p+1 cycles and the low state duration is p cycles; 
when CLKP is 1, the high state duration is p cycles and the low state duration is p+1 cycles. 


The autobuffering process for transmit is illustrated in Figure 3-5 and for 
receive in Figure 3-6. When a process is activated upon request from the seri- 
al port (XRDY = 1 or RRDY = 1), four actions are performed: 


1) ’54x internal memory access, 
2) address register update, 

3) decision for interrupt, 

4) autodisabling management. 


An interrupt is generated whenever the first or second half-of-buffer is pro- 
cessed. The RH and XH bits in BSPCE allow you to know which half has been 
processed when an interrupt boundary is found. For further details on BSP 
operation, refer to the TMS320C54x DSP Reference Set, Volume 1: CPU and 
Peripherals. 
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Figure 3-5. Autobuffering Process for Transmit 


DXR = Mem (AXR);memory access 
AY Rib= 1 ‘increment address 
If (AXRL=BKXL>>1) OR (AXRL=BKL) Then 


Generate XINT; Buffer boundary 
If (AXRL=BKXL>>1) Then 


XH = 0 ;first half emptied 


XRDY = 1 
&BXE =1 


& XRST = 1 


Else 
AXL=0_ ;buffer wrap around 


XH=1  :second half emptied 
Endif 
If (HALTX = 1) Then 

BXE =0 _ ;disconnect ABU 


Endif 
Endif 
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Figure 3-6. Autobuffering Process for Receive 


Mem (ARR) =DRR_ ;memory access 


ARRL = +1 increment address 


lf (ARRL=BKRL>>1) OR (ARRL=BKRL) Then 


RRDY = 1 Generate RINT; Buffer boundary 


ae If (ARRL=BKRL>>1) Then 


RH = 0 _ -first half filled 
Else 
ARL = 0 ;buffer wrap around 
RH =1_ ;second half filled 
Endif 
If (HALTR = 1) Then — ;auto disabling 


BRE =0 disconnect ABU 
Endif 


Endif 
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In order to start or restart BSP operation in standard mode, the same steps are 
performed in software as with initializing the serial port (see TMS320C54x 
DSP Reference Set, Volume 1: CPU and Peripherals), in addition to which, the 
BSPCE register must be initialized to configure any of the enhanced features 
desired. To start or restart the BSP in autobuffering mode, a similar set of steps 
must also be performed, in addition to which, the autobuffering registers must 
be initialized. 


As an illustration of the proper operation of a buffered serial port, Example 3-8 
and Example 3-9 define a sequence of actions. The ’542 peripheral configura- 
tion has been used as a reference for these examples. The examples show 
the actions for initializing the BSP for autobuffering mode operation. In both 
cases, assume that transmit and receive interrupts are used to service the 
ABU interrupts. 


Example 3-8 initializes the serial port for transmit operations only, with burst 
mode, external frame sync, and external clock selected. The selected data for- 
mat is 16 bits, with frame sync and clock polarities selected to be high true. 
Transmit autobuffering is enabled by setting the BXE bit in the BSPCE, and 
HALTX has been set to 1, which causes transmission to halt when half of the 
defined buffer is transmitted. 


Example 3-9 initializes the serial port for receive operations only. Receive au- 
tobuffering is enabled by setting the BRE and HALTR bits in the BSPCE to 1. 


The complete initialization code is shown in Example 3-10 on page 3-36. 
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Example 3-8. BSP Transmit Initialization Routine 


1) 


Action 


Reset and initialize the serial port 
by writing 0008h to SPC. 


Clear any pending serial port in- 
terrupts by writing 0020h to IFR. 


Enable the serial port interrupts 
by ORing 0020h with IMR. 


Enable interrupts globally (if nec- 
essary) by clearing the INTM bit 
in ST1. 


Initialize the ABU transmit by 
writing 1400h to BSPCE. 


Write the buffer start address to 
AXR. 


Write the buffer size to BKX. 


Start the serial port by writing 
0048h to SPC. 


Description 


This places both the transmit and receive portions of the serial port 


in reset and sets up the serial port to operate with externally gener- 
ated FSX and CLkKX signals and FSX required for transmit/receive of 
each 16-bit word. 


Eliminate any interrupts that may have occurred before initialization. 


Enable transmit interrupts. 


Interrupts must be globally enabled for the CPU to respond. 


This causes the autobuffering mode to stop when the current half-of- 
buffer has been transmitted. 


Identify the transmit buffer register address. 


Identify the buffer size of the ABU. 


This takes the transmit portion of the serial port out of reset and starts 
operations with the conditions defined in steps 1 and 5. 


Example 3-9. BSP Receive Initialization Routine 


1) 


Action 


Reset and initialize the serial port 
by writing 0000h to SPC. 


2) Clear any pending serial port in- 
terrupts by writing 0010h to IFR. 

3) Enable the serial port interrupts 
by ORing 0010h with IMR. 

4) Enable interrupts globally (if nec- 
essary) by clearing the INTM bit 
in ST1. 

5) Initialize the ABU transmit by 
writing AO0Oh to BSPCE. 

6) Write the buffer start address to 
ARR. 

7) Write the buffer size to BKR. 

8) Start the serial port by writing 
0088h to SPC. 
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Description 


This places both the transmit and receive portions of the serial port 


in reset and sets up the serial port to operate with externally gener- 
ated FSR and CLKR signals and FSR required for transmit/receive 
of each 16-bit word. 


Eliminate any interrupts that may have occurred before initialization. 


Enable receive interrupts. 


Interrupts must be globally enabled for the CPU to respond. 


This causes the autobuffering mode to stop when the current half-of- 
buffer has been received. 


Identify the receive buffer register address. 


Identify the buffer size of the ABU. 


This takes the receive portion of the serial port out of reset and starts 
operations with the conditions defined in steps 1 and 5. 
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Example 3-10. BSP initialization Routine 


-mmregs 
K_STACK_SIZ!I -set 100 
STACK -usect “stack”, K_STACK_SIZE 
SYSTEM_STACK .set STACK_K_STACK_SIZE 
.def SYSTEM_STACK 
.ref main_start 
.ref bsp_receive_int 
.ref bsp_transmit_int 
-sect “vectors” 


eS 


reset: BD main_start ; RESET vector 
STM #SYSTEM_STACK, SP 

nmi: RETE 
NOP 
NOP 
NOP ,NMI~ 

; software interrupts 

sint17 -space 4*16 

sint18 -space 4*16 

sintl19 -space 4*16 

sint20 -space 4*16 

sint21 -space 4*16 

sint22 -space 4*16 

sint23 -space 4*16 

sint24 -space 4*16 

sint25 -space 4*16 

sint26 -space 4*16 

sint27 -Sspace 4*16 

sint28 -space 4*16 

sint29 -space 4*16 

sint30 -space 4*16 

intO: RETE 
NOP 
NOP ; INTO 
NOP 

intl: RETE : 
NOP 
NOP ; INTL 
NOP 

int2: RETE 
NOP 
NOP 
NOP 

tint: RETE 
NOP 
NOP ; TIMER 
NOP 

brint: BD bsp_receive_int ; Buffered serial port receive intr 
PSHM STO 
PSHM ST1 

bxint: BD bsp_transmit_int ; Buffered serial port transmit intr 
PSHM STO 
PSHM ST1 

trint: RETE ; TDM serial port transmit intr 
NOP 
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Example 3-10. BSP initialization Routine (Continued) 


txint: RETE ; TDM serial port transmit interrupr 


int3: RETE 


NOP ; INT3 


hpiint:RETE , HPI interrupt 


NOP 
.end 
-mmregs 
- include “interrpt.inc” 
- include “init_ser.inc” 
K_AUTO_BFFR_SIZE .set 8 
rtop_bffr -usect “auto_rcv”,K_AUTO_BFFR_SIZE ; starting address of receive bffr 
xtop_bffr -usect “auto_xmt”,K_AUTO_BFFR_SIZE ; starting address of transmit bffr 
.def rtop_bffr 
.def xtop_bffr 
.def serial_init 
: This routine initializes the BSP of 542. The serial port is put 
; in reset by writting 0’s to RRST and XRST bits and pulled out of reset by 
; writting 1’s to both RRST and XRST bits. The BSPCE register is init such 
; that autobuffer is enabled. Also HALTX and HALTR are enabled to halt the 
; autobuffering scheme whenever half buffer is either received or transmitted 


a 
-sect “ser_cnfg” 
serial_init: 
KKEKKKKKKKKKKKKKK KKK KKK KKK KKK KKK KKK KKK KKK KKK KKK KKK KKK KKK KKK KKK KKK KKK KKK KKK KKK KEKE 


* We need atleast 12 cycles to pull the AIC out of reset. 


KKK KKK KKK KKK KKK KKK KKK KKK KKK KKK KKK KKK KKK KKK KKK KKK KKK KKK KKK KK KKK KKK KKK KKK KKK KK KKK KK 


RSBX XF ; Put the ACO1 in reset 

STM #K_SERIAL RST, BSPC ;reset the serial port with 
70000 0000 0000 1000 

STM #K_ABUC, BSPCE ; enable auto-buffer 

STM #K_SERIAL_OUT_RST, BSPC ;bring ser.port out of reset with 
70000 0000 1100 1000 

RSBX INTM ; Enable interrupts 

LD #0,DP 

ORM # (K_BXINT |K_BRINT) , IMR ; Enable both BSP receive 


; transmit interrupt 
7 0000 0000 0011 0000 
STM # (K_BXINT|K_BRINT),IFR ; clear BXINT, BRINT 
; Pull the ACO1 out of reset - the ACO1 requires that it be held in reset for 
; 1 MCLK, which is equivalent to 96.45ns (based on an MCLK of 10.368MHz) 
SSBX XF ; Pull ACO1 out of reset 
RET 
.end 
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Example 3-10. BSP initialization Routine (Continued) 


KEK KK KKK KK KKK KKK KK KKK KK KKK KKK KK KKK KK KKK KKK KK KKK KK KKK KKKKKKKKKK KKK KKK KK KKK 
* FILENAME: SERIAL_INIT.INC” 
* This include file includes the BSPC register configuration 

SPC Register Organization 


, 
* 
* 
* 
* 
* 
* staeceaoeanes 
* 
* 
* 
* 
* 


| 13 | 12 | 11 | 10 | 9 | 8 | 
|-------- | ---------| -==---]--=---|----]----- 
| RSRFULL| XSREMPTY| XRDY | RRDY | IN1| INO | 


KKK KKK KKK KKK KKK KK KK KKK KKK KK KKK KKK KKK KKK KKK KKK KKK KEK KKK KKK KKK KKK KKK KKK KKK 


,Bit Name 


rat Reserved 
31 DLB 
2 FO 

r 

73 FSM 
r 

74 MCM 
r 

pd TXM 
, 

76 XRST 
r 

37 RRST 
r 

78 INO 
79 IN1 


710 RRDY 
;1l XRDY 
312 XSREMPTY 


713 RSRFUL 


714 SOFT 
315 FREE 


, 

K_O 
K_RRST 
, 
K_XRST 
; 
K_TXM 
K_MCM 
K_FSM 
K_ZERO 
K_SERIAL_RST 


‘4 
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Function 
Always read as 0 
Digital loop back : 0 -> Disabled, 1 _. Enabled 


Format bit: 0 - > data transfered as 8 bit bytes, 
1 -> 16 bit words 

Frame sync pulse: 0 -> serial port in continuous mode, 

1 -> FSM is required 
Clock mode bit: 0 -> CLKX obtained from CLKX pin 

1-> CLKX obtained from CLKX 

Transmit mode bit: 0 -> Frame sync pulses generated externally 
1-> Internally generated frame sync 
Transmit reset bit:0 -> reset the serial port, 

1-> bring serial port out of reset 
Receive reset bit: 0 -> reset the serial port, 

1-> bring serial port out of reset 
Read-only bit reflecting the state of the CLKR pin 
Read-only bit reflecting the state of the CLKX pin 
Transition from 0 to 1 indicates data is ready to be read 
Transition from 0 to 1 indicates data is ready to be sent 
Transmit shift register empty ( Read-only) 

0 -> tramsitter has experienced underflow, 

1-> has not expereinced underflow 
Receive shift register full flag (Read-only): 

0 -> Receiver has experienced overrun, 

1-> receiver has not experienced overrun 
Soft bit - 0 -> immdeiate stop, 1-> stop after word completion 
Free run bit: 0 -> behaviour depends on SOFT bit, 

1-> free run regardless of SOFT bit 

-set OO0000000b << 8 ; bits 15-8 to 0 at reset 


-set Ob << 7 ; First write to BSPC is 0 
; second write is 1 
-set Ob << 6 ; First write to BSPC is 0 


, second write is 1 
.set Ob << 5 
.set Ob << 4 
-set Ib << 3 ; Frame Sync mode 
.set 000b << 0 
.set K_0|K_RRST|K_XRST|K_TXM|K_MCM|K_FSM|K_ZERO 
, first write to BSPC regsiter 
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Example 3-10. BSP initialization Routine (Continued) 


K_RRST1 .se 
K_XRST1 .se 
K_SERIAL_OUT_RST .se 


lb << 7 ; second write to BSPC 
lb << 6 ; second write to BSPC 
K_0|K_RRST1|K_XRST1|K_TXM|K_MCM|K_FSM|K_ZERO 

7 second write to SPC1l regsiter 


cv (CT CF 


K_TRGCR_ADDR .set 14h ; Timer Control Register I/O 

; address 
K_O .set Oh 
K_8000 .set 8000h 7 set bit 15 to pull AIC out 

; of reset 
; BSPCE Register Organization 
SS a i a a a a a a a 
* 15 | 14 ]13 | 12 |411]10]9 |8 |7 | 6] 5 | 4 oO | 
+ [-==-- |----]----|------ )----]---- |----]----]----] ---- |-----]---------- | 
* |HALTR| RH | BRE| HALTX| XH | BXE| PCM| FIG| FE |CLKP| FSP | CLKDIV | 
* 


; Auto-Buffering Control Register (ABU) 


K_CLKDV .set 00000 << 0 ; External clock 

K_FSP .set 0 << 5 ; Frame sync pulses are active 
; high at reset 0 

K_CLKP .set QO << 6 ; CLKP = 0 at reset 

K_FE .set Q << 7 ; 16 bit format 

K_FIG .set OQ << 8 ; at reset =0 

K_PCM .set Q << 9 ; no PCM mode 

K_BXE .set i << 10 ; enable transmit auto-buffer 
; mode 

K_HALTX .set 1 << 12 ; auto-buffer is halted after 
; half buffer has been trans 
7 mitted 

K_BRE .set i, << 13 ; enable receive auto-buffer 

K_HALTR .set 1 << 15 ; auto-buffer is halted after 
; half buffer has been received 

K_SPIC .set| K_CLKDV|K_FSP|K_CLKP | K_FE|K_FIG|K_PCM 

K_ABUC .set K_SPIC|K_BXE|K_HALTX|K_BRE|K_HALTR 

K_XH .set 0800h ; transmit half buffer check 

K_RH set 4000h ; receive half buffer check 


* FILENAME: INTERRUPT.INC 
* This file includes the IMR and IFR configuration 
; 542Interrupt Mask Register (IMR) Organization 


KKK KKK KKK KKK KK KKK KKK KKK KKK KK KKK KKK KKK KKK KK KKK KKK KKK KKK KKK KK KKK KKK KKK KK KKK KKK KKK KK 


K IMR _RESR .set 0000000b << 9 ; veserved space 
K_HPIINT .set ib << 8 ; Gisable HPI interrupt 
K_TXINT .set ib << 7 ; enable TDM transmit 1 
K_ TRINT .set ib << 6 ; nable TDM receive 1 
K BXINT .set ib << 5 ; enable BSP transmit 

; interrupt 
K_BRINT .set lb << 4 ; enable BSP receive intr 
K_ TINT .set ib <= 3 7 nable timer interrupt 
K_ INT2 .set lb << 2 ; enable INT2 
K_INT1 .set Ib << 1 ; enable INT1 
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Example 3-10. BSP initialization Routine (Continued) 


K_INTO 


.set lb << 0 


-mmregs 


-include “"interrpt.inc 
-include "init_ser.inc 


“ 


a 


; enable INTO 


ref rtop_bffr 
.ref wrt_cnfg ; initializes ACO1 
.def aic_init 
; 5.2 Functional Description 
; This routine disables IMR and clears any pending interrupts before 
7 initializing ACO1. The wrt_cnfg function configures the ACO1 
, 
-sect “aic_cnfg” 
aic_init: 
CALLD wrt_cnfg ; initialize ACO1 
ANDM #~ (K_BXINT|K_BRINT) , IMR ; Gdisable receive_intl 
LD #rtop_bffr,A 
STLM A,AXR ; PING/PONG buffering scheme 
STLM A, ARR ; is used 
STM #8, BKX ; transmit circular size 
STM #8, BKR ; receive circular size 
ORM # (K_BXINT|K_BRINT),IMR ; enable the RINT1, INTL 
RETD 
STM (K_BXINT|K_BRINT) , IFR 7 service any pending interrupt 
.end 
-include "aic_cfg.inc” 
.def wrt_cnfg 
; Writes new configuration data into the ACO1. Assuming a system 
; which processes speech signals and * requires the following parameters 
: Low pass filter cut-off frequency = 3.6 kHz 
; Sampling rate = 8000 Hz 
; Assume the Master clock MCLK = 10.368 MHz 
; This example demonstrates how to program these parameters 
; the registers affected are: 
; Register A which determines the division of the MCLK frequency 
: to generate the internal filter clock FCLK. 
: It also determines th 3 dB corner frequency of the low-pass filter 
; Register B which determines the division of FCLK to generate 
; the sampling (conversion) frequency 
; It also determines the -3dB corner frequency of the high-pass filter 
¥ 
asg AR1,AIC_REG _P 
-sect “aic_cnfg” 
wrt_cnfg: 
STM aic_reg_tble, AIC_REG_P j;init ARI 
RPT AIC_REG_END_LIST-AIC_REG_START_LIST 
MVPD #tATC_REG_START_LIST, *AIC_REG_P+ 7 move the table 
STM faic_reg_tble,AIC_REG_P ;init AR1 
ST™ K_REG_0O,BDXR ;primary data word - a jump start! 
wait_xrdy 
BITF BSPC, K_XRDY ;test XRDY bit in BSPC 
BC wait_xrdy,NTC ;loop if not set 
STM #K_SCND_CONTRL, BDXR ;send primary word with DO1-D00 = 11 to 
;Signify secondary communication 
LD *AIC_REG_P+,A 
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Example 3-10. BSP initialization Routine (Continued) 


STLM A,BRC ;gives the # of regsiters to be 
NOP ;initialized 
RPTB aic_cfg_complte-1 

wait_xrdyl 


BITF BSPC,K_XRDY ;test XRDY bit in BSPC 
BC wait_xrdy1,NTC ;loop if not set 
LD *ATC_REG P+,A ;Read the register contents 


STLM A, BDXR 
wait_xrdy2 


BITF BSPC, K_XRDY ;test XRDY bit in BSPC 

BC wait_xrdy2,NTC ;loop if not set 

STM #K_SCND_CONTRL, BDXR ;set to read the next register 
aic_cfg_complte ;contents 

RET 

.end 


KKK KKK KKK KKK KKK KKK KKK KKK KKK KKK KR KKK KKK KKK KKK KKK KKK KKK KKK KKK KKK KKK KKK KKK 


* FILENAME: AIC_CFG.INC 


* This file includes the ACO1 registers initialization 

* All registers have 2 control bits that initiates serial communication 

* There are 2 communication modes - primary and secondary communications 

* During primary communication the control bits DOO and DO1 are 11 to request 
* for a secondary communication. In the secondary serial communications the 
* control bits D15 and D14 perform same control function as primary. 

* The R/W~ bit at reset is set to 0 placing the device in write mode. 

KKK KK KK KKK KKK KKK KEK KKK KK KKK KEK KKK KKK KEK KKK KKK KEK KKK KK KK KKK KKK KK KKK KKKKKKKK 
K_NOP_ADDR -set 0 << 8 

K_REG_0O -set K_NOP_ADDR 

K_A_ADDR .set 1 << 8 ; REG 1 address 

K_A_REG -set 36 

K_REG_1 -set K_A_ADDR|K_A_REG ; FCLK = 144KHz => A =24h 

K_ B ADDR .set 2 << 8 ; REG 2 address 

K_B REG .set 18 

K_REG_2 .set K_B_ADDR|K_B_REG ; Sampling rate = 8KHz 

K_AA_ ADDR .set 3 << 6 ; Register 3 address 

K_ AA REG -set 0 

K_ REG_3 -set K_ AA ADDR|K AA REG ; ; no shift 

K GAIN ADDR .set 4 << 8 ; Register 4 address 
K_MONITOR_GAIN .set O0b << 4 ; Monitor output gain = squelch 
K_ANLG_IN_GAIN .set Olb << 2 ; Analog input gain = OdB 
K_ANLG_OUT_GAIN .set Olb << 0 ; Analog output gain = OdB 

K_ REG_4 -set K_GAIN |_ ADDR | K_] MONITOR, GAIN|K ANLG_IN GAIN|K ANLG_OUT_GAIN 
K_ANLG_CNF_ADDR .set 5 << 6 ; Register 5 address 

K ANLG RESRV .set QO << 3 ; Must be set to 0 

K_ HGH_FILTER .set Q << 2 ; High pass filter is enabled 

K_ ENBL_IN .set Olb << 0 ; Enables IN+ and IN- 

K_REG_5 -set K_ANLG_CNF_ADDR]|K_ANLG RESRV | K_HGH_FILTER|K_ENBL_IN 
K_DGTL_CNF_ADDR .set 6 << 8 ; Register 6 address 

K_ ADC_DAC .set 0 << 5 ; ADC and DAC is inactive 

K FSD_OUT .set 0 << 4 ; Enabled FSD output 
K_16_BIT_COMM .set 0 << 3 ; Normal 16-bit mode 
K_SECND_COMM .set GO << 2 ; Normal secondary communication 
K_ SOFT_RESET .set QO << 1 ; Inactive reset 

K_ POWER_DWN .set 0 << 0 ; Power down external 
K_REG_HIGH_6 .set| K_DGTL_CNF_ADDR|K_ADC_DAC|K_FSD_OUT|K_16_BIT_COMM 
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K_REG_LOW_6 .set| K_SECND_COMM|K_SOFT_RESET | K_POWER_DWN 
K_REG_6 .set. K_REG_HIGH_6|K_REG_LOW_6 
K_FRME_SYN_ADDR .set 7 << 8 ; Register 7 address 
K_FRME_SYN set 0 << 8 ; 

K_REG_7 .set. K_FRME_SYN_ADDR|K_FRME_SYN 
K_FRME_NUM_ADDR .set 8 << 8 ; Register 8 address 
K_FRME_NUM set 0 << 8 ; 

K_REG_8 .set| K_FRME_NUM_ADDR|K_FRME_NUM 


; primary word with DO1 and DOO bits set to 11 will cause a secondary 
; communications interval to start when the frame sync goes low next 


K_SCND_CONTRL .set llb << 0 ; Secondary communication request 
AIC_REG_START_LIST -sect “aic_reg” ; includes the aic table 
-word AIC_REG_END_LIST-AIC_REG_START_LIST-1 
-word K_REG_1 
-word K_REG 2 
-word K_REG_3 
-word K_REG 4 
-word K_REG_5 
-word K_REG_6 
-word K_REG_7 
-word K_REG_8 
AIC_REG_END_LIST 
aic_reg_tble -usect “aic_vars”,10 
K_XRDY ~set 0800h ; XRDY bit in BSPC 


-mmregs 
-include ”INTERRPT.INC” 
-include "init_ser.inc” 


.def bsp_receive_int 
.def bsp_transmit_int 
7 bsp transmit and receive interrupt service routine 
; This routine performs BSP receive interrupt and transmit interrupt in 
; autobuffering mode. Since HALTX and HALTR = 1, the autobuffering mode 
: is disabled thus BXE and BRE =1 to continue in autobuffering mode. 
gee oeeeboeee sete see seses pac peta eres pata ear ean - 
-sect "main_prg” 
bsp_receive_int: 
LD #0,DP 
ORM #K_BRE, BSPCE ; enable the auto-buffer mode 
POPM STl 
POPM STO 
RETE ; return and enable interrupts 


KKK KKK KKK KKK KKK KKK KKK KKK KK KKK KKK KKK KKK KKK KKK KKK KKK KKK KKK KKK KKK KK KKK KKK KKK KK KKK 


bsp_transmit_int: 


LD #0,DP 

ORM #K_BXE, BSPCE ; enable the auto-buffer mode 
POPM ST1 

POPM STO 

RETE ; return and enable interrupts 
.end 
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-mmregs 

-tnelude “init_54x.ane” 
-include "main.inc” 
-include “"init_ser.inc” 


.ref d_auto_bffr_flag,d_auto_bffr_reg, rtop_bffr 


.ref RCV_INT1_DP 


.ref aic_init,serial_init,init_54,init_bffr_ptr_var 


.def main_start 
-sect “main_prg” 


main_start: 


CALL init_54 
CALL init_bffr_ptr_var 
CALL serial_init 
-if K_DEFAULT_ACO1 = 1 
CALLD aic_init 
LD #0,DP 
NOP 
-else 
LD #xrtop_bffr,A 
STLM A,AXR 
STLM A,ARR 
STM #8, BKX 
STM #8,BKR 
-endif 
start_loop 
LD #RCV_INT1_DP, DP 
loop: 
BITF d_auto_bffr_flag,1 
BC loop, NTC 
St #0,d_auto_bffr_flag 
LD #xrtop_bffr,A 
MVKD BSPCE, d_auto_bffr_reg 
BITF d_auto_bffr_reg,K_RH 
BC half_buffer_empty,NTC 
ADD #K_AUTO_BFFR_SIZE/2,A 
half_buffer_empty 
STLM A, AXR 
B loop 
.end 
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initialize STO,ST1 PMST and other regsiters 


init tables,vars,bffrs,ptr 
initialize serial_port 1 


Configures ACO1 
default ACO1 config 
sampling rate = 16KHz 
init. the buffers 
transmit circular size 
receive circular size 
restore the DP 


check if auto-buffering sc 
is enabled 


PING buffer address 
load the status of SPCE 
check if first is emptied 


PONG buffer 


like PING/PONG buffer sche 


heme 


me 


Analog I/O 
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Chapter 4 


Signal Processing 


Certain features of the ‘54x architecture and instruction set facilitate the 
solution of numerically intensive problems. Some examples include filtering, 
encoding techniques in telecommunication applications, and speech recogni- 
tion. This chapter discusses digital filters that use both fixed and adaptive 


coefficients and fast Fourier transforms. 
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4.1 Finite Impulse Response (FIR) Filters 


Digital filters are a common requirement for digital signal processing systems. 
There are two types of digital filters: finite impulse response (FIR) and infinite 
impulse response (IIR). Each of these can have either fixed or adaptive coeffi- 
cients. 


If an FIR filter has an impulse response, h(0), h(1), ..., h(N—-1), and x(n] repre- 
sents the input of the filter at time n, the output y(n] at time n is given by the 
following equation: 


y(n) = A(O)x(n) + A(1)x(n-1) + A(2)x(n-2) + ...4 A(N-1)x[n-(N-1)] 


Figure 4—1 illustrates a method using circular addressing to implement an FIR 
filter. To set up circular addressing, initialize BK to block length N. The locations 
for d_data_buffer, and impulse responses, COFF_FIR, must start from 
memory locations whose addresses are multiples of the smallest power of 2 
that is greater than N. Forinstance, if N = 11, the first address ford_data_buffer 
must be a multiple of 16. Thus, the lowest four bits of the beginning address 
must be 0. 


Figure 4—1. Data Memory Organization in an FIR Filter 
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Low address 


High address 


Initial input Final input 
samples samples 


Oldest input 


Circular 
queue 


Newest input 


In Example 4-1, Nis 16 and the circular buffer starts at an address whose four 
LSBs are 0. 
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Example 4—1. FIR Implementation Using Circular Addressing Mode With a Multiply 


and Accumulate (MAC) Instruction 


; TEXAS INSTRUMENTS INCORPORATED 
-mmregs 
-include “"main.inc” 

; the 16 tap FIR coefficients 


COFF_FIR_START 


.word 
.word 
.word 
.word 
.word 
.word 
.word 
.word 
.word 
.word 
.word 
.word 
.word 
.word 
.word 
.word 


6Fh 
OF3h 
269h 
50Dh 
8A9h 
0C99h 
OFF8h 
11EBh 
11EBh 
OFF8h 
O0C99h 
8A9h 
50Dh 
269h 
OF3h 
6Fh 


~8éect 


"Cott fir" ; 


filter coefficients 


COFF_FIR_END 
FIR_DP 

d_filin 
d_filout 
fir_coff_table 
d_data_buffer 


,Uusect 
<Uusect 
sUSect 


"fir_vars”,0 
"Pin vars” ,L 
"fir_vars”,1 


,USeCct 
<usect 
.def 
.def 


"fir coff"”,20 

"fir bir’, 40 3 
fir_init : 
fir_task 7 


buffer size for the filter 
initialize FIR filter 
perform FIR filtering 


; Functional Description 
; This routine initializes circular buffers both for data and coeffs. 


-asg ARO, FIR_INDEX_P 
-asg AR4,FIR_DATA_P 
.asg AR5,FIR_COFF_P 
.sect "fir prog” 
pigs eames Th gies Ue 
STM #fir_coff_table, FIR_COFF_P 
RPT #K_FIR_BFFR-1 
MVPD #COFF_FIR_START, *FIR_COFF_P+ 
STM #K_FIR_INDEX, FIR_INDEX_P 
STM #d_data_buffer,FIR_DATA_P 
RPTZ A, #K_FIR_BFFR 
STL A, *FIR_DATA_ Pt 
STM # (d_data_buffer+K_FIR_BFFR-1), 
RETD 
STM #fir_coff_table, FIR_COFF_P 
PRELIMINARY 


7 move FIR coeffs from program 
; to data 


; load cir_bfr address for the 
; recent samples 


; reset the buffer 
FIR_DATA_P 
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Example 4—1. FIR Implementation Using Circular Addressing Mode with a Multiply 
and Accumulate (MAC) Instruction (Continued) 


; Functional Description 


; This subroutine performs FIR filtering using MAC instruction. 


; accumulator A (filter output) = h(n)*x(n-i) for i = 0,1...15 
r 

-asg AR6, INBUF_P 

.asg AR7, OUTBUF_P 

-asg AR4,FIR_DATA_P 

-asg AR5,FIR_COFF_P 

-sect "fir prog” 
fir_task: 
; LD #FIR_DP, DP 

STM #K_FRAME SIZE-1, BRC ; Repeat 256 times 

RP TBD fir_filter_loop-1 

STM #K_FIR_BFFR, BK ; FIR circular bffr size 

LD *TNBUF_P+, A ; load the input value 
fir filter: 

STL A, *FIR_DATA_P+% ; replace oldest sample with newest 

7 sample 

RPTZ A, (K_FIR_BFFR-1) 

MAC *FIR_DATA_P+0%,*FIR_COFF_P+0%,A ; filtering 

STH A, *OUTBUF_P+ ; replace the oldest bffr value 
fir_filter_loop 

RET 
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In asecond method, two features of the 54x device facilitate implementation 
of the FIR filters: circular addressing and the FIRS instruction. The FIR filter 
shown in Figure 4—2, with symmetric impulse response about the center tap, 
is widely used in digital signal processing applications because of its linear 
phase response. In applications such as speech processing, linear phase re- 
sponse is required to avoid phase distortion, which degrades the quality of the 
signal waveforms. The output of the filter for length N is given by: 


N/2-1 


y(n) = > h(k)[x(n— k) + x(n— (N-1+ k))] n=0,1,2 
k=0 


Figure 4-2. Block Diagram of an Nth-Order Symmetric FIR Filter 


x(n) = 
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Figure 4-3 shows the storage diagram of the input sequence for two circular buff- 
ers. To build the buffers, the value of N/2 is loaded into a circular buffer size regis- 
ter. AR4 is set to point to the top of buffer 1 and AR5 points to the bottom of buffer 
2. The data at the top of buffer 1 is moved to the bottom of buffer 2 for the delayed 
operation before storing new sample data in buffer 1. The processor then per- 
forms the adds and multiplies h(0){x(0)+x(—N+1)}. After each iteration of the filter- 
ing routine, AR4 points to the next time slot window for the data move and AR5 
points to the next input sample. For the next iteration of the filtering routine, AR4 
points to address 1 and AR5 points to address N/2-2. 


Figure 4—3. Input Sequence Storage 
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Buffer 1 Address Buffer 2 Address 


Input Sample 
0.\€ AR4 


x(0) 


Example 4—2 shows how to implement a symmetric FIR filter on the ’54x. It 
uses the symmetrical finite impulse response (FIRS) instruction and the repeat 
next instruction and clear accumulator (RPTZ) instruction together. FIRS can 
add two data values (input sequences stored in data memory) in parallel with 
multiplication of the previous addition result using an impulse response stored 
in program memory. FIRS becomes a single-cycle instruction when used with 
the single-repeat instruction. To perform the delayed operation in this storage 
scheme, two circular buffers are used for the input sequence. 
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Example 4-2. Symmetric FIR Implementation Using FIRS Instruction 


, TEXAS INSTRUMENTS INCORPORATED 
-mmregs 
- include "main.inc” 
FIR_COFF -sect “sym_fir” ; filter coefficients 
.word 6Fh 
.word OF3h 
.-word 269h 
.word 50Dh 
.-word 8A9h 
.word 0C99h 
.-word OFF8h 
.word 11EBh 
d_datax_buffer susect “cir_bfr”,20 
d_datay_buffer -usect "“cir_bfr1”,20 
-def sym_fir_init ; initialize symmetric FIR 
def sym_fir_task 
v 
; Functional Description 
; This routine initializes circular buffers both for data and coeffs. 
v 
asg ARO, SYMFIR_INDEX_P 
asg AR4, SYMFIR_DATX_P 
asg AR5,SYMFIR_DATY_P 
sect "“sym_fir” 
sym_fir_init: 
STM #d_datax_buffer,SYMFIR_DATX_P ; load cir_bfr address 


_buffer+K_FIR_BFFR/2-1,SYMFIR_DATY_P 


STM #K_negl, SYMFIR_INDEX_P 

RPTZ A, #K_FIR_BFFR 

STL A,* SYMFIR_DATX_P+ 

STM #d_datax_buffer, SYMFIR_DATX_P 
RPTZ A, #K_FIR_BFFR 

STL A,* SYMFIR_DATY_P- 

RETD 

STM #d_datay_buffer+K_FIR_BFFR/2- 
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1, 


for the 8 most 
recent samples STM#d_datay 


load cir_bfrl address 
for the 8 old samples 
index offset - 

whenever the pointer 
hits the top of the bffr, 
it automatically hits 
bottom address of 

buffer and decrements 
the counter 


SYMFIR_DATY_P 
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Example 4-2. Symmetric FIR Implementation Using FIRS Instruction (Continued) 


; Functional Description 
;This program uses the FIRS instruction to implement symmetric FIR filter 
;Circular addressing is used for data buffers. The input scheme for the data; 
;samples is divided into two circular buffers. The first buffer contains 

and the second buffer contains samples from 


;samples from X(-N/2) 
;X(-N) to X(-N/2-1). 


to X(-1) 


AR6, INBUF_P 
AR7, OUTBUF_P 
AR4, SYMFIR_DATX_P 
AR5, SYMFIR_DATY_P 


sym_fir_task: 
STM 
RPTBD 
STM 
LD 

symmetric_fir: 
MVDD 
STL 


ADD 

RPTZ 
FIRS 
MAR 
MAR 

STH 

sym_fir_filter_loop 

RET 

.end 
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"sym_fir 


“W 


#K_FRAME 


SIZE-1,BRC 


sym_fir_filter_loop-1 
#K_FIR_BFFR/2,BK 
*INBUF_P+, B 


*SYMFTR_DA’ 


B, *SYMFI 


R_DATX_P 


r 


a 


X_P, *SYMFIR_DATY P+0% ; move X(-N/2) to X(-N) 


replace oldest sample with newest 
sample 


*SYMFIR_DATX _P+0%,*SYMFIR_DATY P+0%,A  ; add X(0)+X(-N/2-1) 


B, #(K_FI 


R_BFFR/2-1) 


*SYMFIR_DATX_P+0%, *SYMFIR_DATY_P+0%,FIR_COFF 
*+SYMFIR_DATX_P (2) % 


*SYMFIR_| 
B, *OUTB 


DATY_P+% 
UF__P + 


, 


, 


to load the next newest sample 
position for the X(-N/2) sample 
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4.2 Infinite Impulse Response (IIR) Filters 


IIR filters are widely used in digital signal processing applications. The transfer 
function of an IIR filter is given by: 
— by + ByZzT +... + Dy Z™ — Y¥(2) 


re) 1+ azt t+... + ayz% ~ x(Z) 


The transfer function has both poles and zeros. Its output depends on both in- 
put and past output. IIR filters need less computation than FIR filters. However, 
IIR filters have stability problems. The coefficients are very sensitive to coeffi- 
cient quantization. Figure 4—4 shows a typical diagram of an IIR filter. 


Figure 4-4. Nth-Order Direct-Form Type II IIR Filter 


_aN-1 


ud if 


bN-1 


Most often, IIR filters are implemented as a cascade of second-order sections, 
called biquads. The block diagram is shown in Figure 4—5. 


Figure 4—5. Biquad IIR Filter 


PRELIMINARY Signal Processing 4-9 


Infinite Impulse Response (IIR) Filters PRELIMINARY 


Example 4—3. Two-Biquad Implementation of an IIR Filter 


; TEXAS INSTRUMENTS INCORPORATED 


-mmregs 
- include "main.inc” 
-sect "ITP COLL” 


iir_table_start 
* 


* second-order section # 01 
* 


.word -26778 7A2 
.word 29529 7Al1/2 
.word 19381 7B2 
.word -23184 7Bl 
.word 19381 ; BO 


second-order section # 02 


.word -30497 7A2 
.word 31131 7Al1/2 
.word 11363 7B2 
.word -20735 7Bl 
-word 11363 7; BO 
iir_table_end 
iir_coff_table -usect "“coff_iir”,16 
IIR_DP -usect "“iir_vars”,0 
ad jAir od -usect "iir _vars”,3*2 
d_iir_y -usect “iir_vars”,1 
.def er eh go 
.def iir_task 


; ees Aone meee, Seuss pects pees aan 
; Functional Description 
; This routine initializes buffers both for data and coeffs. 


a 


-asg AR5, IIR_DATA_P ; data samples pointer 
.asg AR4, IIR_COFF_P ; IIR filter coffs pointer 
-sect ere 


iir_init: 
STM #iir_coff_table, IIR_COFF_P 


RPT #K_IIR_SIZE-1 7 move IIR coeffs from program 
MVPD #iir_table_start,*IIR_COFF_P+ ; to data 

; LD #IIR_DP, DP 
STM #d_iir_d, IIR_DATA_P s;ARS:d(n),d(n=—1),d(a-2) 
RPTZ A, #5 ;initial d(n),d(n-1),d(n-2)=0 
STL A, *IIR_DATA_P+ 
RET 
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Example 4—3. Two-Biquad Implementation of an IIR Filter (Continued) 


, Functional Description 


; This subroutine performs IIR filtering using biquad sections 

; IIR Low pass filter design 

; Filter type : Elliptic Filter 

: Filter order : 4 order (cascade: 2nd order + 2nd order) 

7 cut freq. of pass band : 200 Hz 

; cut freq. of stop band : 500 

A BO 

; sae ae A SG) SSeS XS Re a SSeS -> 

i | | | 

; | Al | Bl | 

7 + <= & == din-1) <= = => + | 

, 

; A2 B2 

; + <= *% == d(n2)) exe > + 

, 

; second order IIR 

v 
-asg AR5, I1IR_DATA_P ; data samples pointer 
-asg AR4, IIR_COFF_P ; IIR filter coffs pointer 


-asg AR6, INBUF_P 
-asg AR7, OUTBUF_P 
-asg AR1, IIR_INDEX_P 
s8e6et, ) iar” 


iir_task: 
STM #K_FRAME_ SIZE-1,BRC ; Perform filtering for 256 samples 
RPTB iir_filter_loop-1 
LD *INBUF_P+,8,A ; load the input value 
iir_filter: 
STM #d_iir_d+5,IIR_DATA_P ;AR5:d(n),d(n-1),d(n-2) 
MVPD #iir_table_start, *IIR_COFF_P+t+ ; to data 
STM #iir_coff_table, IIR_COFF_P ;AR4:coeff of IIR filter A2,A1,B2,B1,B0 


STM #K_BIQUAD-1, IIR_INDEX_P 
feedback_path: 


MAC *TIR_COFF_P+,*IIR_DATA P-,A ; inputt+d(n-2) *A2 
MAC *TIR_COFF_P,*IIR_DATA_P,A ;inputtd(n-2) *A2+d(n-1) *A1/2 
MAC *TIR_COFPF_P+, *IIR_DATA_P-,A ; A = At+td(n-1)*A1/2 
STH A, *IIR_DATA P+ ;d(n) = input+d(n-2) *A2+d (n-1) *Al 
MAR *TIR_DATA P+ 
* Forward path 
MPY *TIR_COFF_P+,*IIR_DATA P-,A ;d(n-2) *B2 
MAC *TIR_COFF_P+,*IIR_DATA _P,A ;d(n-2) *B2+d(n-1) *B1 
DELAY *IIR_DATA_P- ;d(n-2)=d(n-1) 
eloop: 
BANZD feedback_path, *IIR_INDEX_P- 
MAC *TIR_COFF_P+,*IIR_DATA _P,A ;d(n-2) *B2+d (n-1) *B1+d(n) *BO 
DELAY *IIR_DATA P- ;a(n-1) =d (n) 
STH A,d_iir_y ; output=d (n-2) *B2+d(n-1) *B1+d (n) *BO 
LD d_iir_y,2,A 7 scale the output 
STL A, *OUTBUF_P+ ; replace the oldest bffr value 
iir_filter_loop 
RET 
.end 
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4.3 Adaptive Filtering 


Some applications for adaptive FIR and IIR filtering include echo and acoustic 
noise cancellation. In these applications, an adaptive filter tracks changing 
conditions in the environment. Although in theory, both FIR and IIR structures 
can be used as adaptive filters, stability problems and the local optimum points 
of IIR filters makes them less attractive for this use. FIR filters are used for all 
practical applications. The LMS, ST||MPY, and RPTBD instructions on the 54x 
can reduce the execution time of code for adaptive filtering. The block diagram 
of an adaptive FIR filter is shown in Figure 4-6. The Adaptive filtering routine 
is shown in Example 4—4, page 4-14. 


Figure 4—6. Adaptive FIR Filter Implemented Using the Least-Mean-Squares (LMS) 
Algorithm 


z| | 2-1 | —»- ar a Desired 
response 


LMS 


On the ’54x, one coefficient can be adapted by using the least-mean-squares 
(LMS) algorithm, which is given by 


by (i+ 1) = by (i) + 2Be(i)x(i—k), 

where: 

e(i) = ali) — y(i) 

The output of the adaptive filter is given by 


N-1 


yli) =) bk x (i= K) 
k=0 
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The LMS instruction can perform a MAC instruction and an addition with 
rounding in parallel. The LMS algorithm calculates the filter output and up- 
dates each coefficient in the filter in parallel by using the LMS instruction, along 
with the ST||MPY and RPTBD instructions. For each coefficient in the filter at 
a given instant, 2Be(i) is a constant. This factor can be computed once and 
stored in the temporary register, T, to use in each of the updates. The ST||MPY 
instruction multiplies a data sample by this factor, then the LMS instruction up- 
dates a coefficient in the filter and accumulates the filtered output. Since the 
factor is stored in T, the adaptive filtering in a time-slot window is performed 
in N cycles. 


An adaptive filter can be used in modeling to imitate the behavior of a physical 
dynamic system. Figure 4—7 shows a block diagram of system identification, 
where the adaptive filter adjusts itself to cause its output to match that of the 
unknown system. H(z) is the impulse response of an unknown system; for ex- 
ample, the echo response in the case of an acoustic echo cancellation system. 
The signal x(n) trains the system. The size of the adaptive filter is chosen to 
be N, where N is the number of taps (coefficients) of the known system, W(z). 


Two circular buffers store the input sequence. AR3 points to the first buffer, 
AR2 points to the coefficients of H(z), AR4 points to the coefficients of W(z), 
and AR5 points to the second buffer. The newest sample is stored ina memory 
location that is input to the adaptive filter. The input sample is subtracted from 
the output of the adaptive filter to obtain the error data for the LMS algorithm. 
In this example, the adaptive filter output is computed for the newest sample 
and the filter coefficients are updated using the previous sample. Thus, there 
is an inherent delay between the update of the coefficient and the output of the 
adaptive filter. 


Figure 4—7. System Identification Using Adaptive Filter 
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Example 4—4. System Identification Using Adaptive Filtering Techniques 


; TEXAS INSTRUMENTS INCORPORATED 


-mmregs 
-include “"main.inc” 
scoff .sect "coeffh” 
-include “impulse.h” 
ADAPT_DP -usect “adpt_var”,0 
d_primary -usect “adpt_var”,1 
d_output -usect “adpt_var”,1 
d_error -usect “adpt_var”,1 
d_mu -usect “adpt_var”,1 
d_mu_e -usect “adpt_var”’,1 
d_new_x -usect “adpt_var”,1 
d_adapt_count -usect "adpt_var”’,1 
hcoff -usect “"bufferh”, H_FILT_SIZE ; H(z) coeffs 
weoff -usect “bufferw”, ADPT_FILT_SIZE W(z) coeffs 
xh -usect “bufferx”, H_FILT_SIZE ; input data to H(z) 
XW -usect “bufferp”, ADPT_FILT_SIZE ; input data-adaptive filter 
.def adapt_init, adapt_task 


; Functional Description 


: This subroutine moves filter coefficients from program to data space. 
; Initializes the adaptive coefficients, buffers, vars, and sets the circular 
; buffer address for processing. 


-asg ARO, INDEX_P 

-asg AR1, INIT_P ; initialize buffer pointer 
-asg AR3,XH_DATA_P ; data coeff buffer pointer 
.asg AR5,XW_DATA_P ; data coeff buffer pointer 


; for cal.y output 
-sect “filter” 
adapt_init: 
; initialize input data location, input to hybrid, with zero. 
STM #xh, INIT_P 
RPTZ A, #H_FILT_SIZE-1 
STL A, *INIT_P+ 
: initialize input data location, input to adaptive filter, with Zero. 
STM #xw, INIT_P 
RPTZ A, #ADPT_FILT_SIZE-1 
STL A, *INIT_P+ 
; initialize adaptive coefficient with Zero. 
STM #wcoff, INIT_P 
RPTZ A, #ADPT_FILT_SIZE-1 
STL A, *INIT_P+ 
7 initialize temporary storage locations with zero 
STM #d_primary, INIT_P 
RPTZ A, #6 
STL A, *INIT_P+ 
; copy system coefficient into RAM location, Reverse order STM#hcoff,INIT_P 
RPT #H_FILT_SIZE-1 
MVPD #scoff, *INIT_P+ 
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Example 4-4. System Identification Using Adaptive Filtering Techniques (Continued) 


‘ LD #ADAPT_DP,DP 7set DP now and not worry about it 
ST #K_mu, d_mu 
STM #1, INDEX_P ; increment value to be used by 


; Qual address 
; associate auxiliary registers for circular computation 
STM #xh+H_FILT_SIZE-1, XH_DATA_P ; last input of hybrid buffer 
RETD 
STM #xw+ADPT_FILT_SIZE-1,XW_DATA_P ;last element of input buffer 


, Functional Description 


; This subroutine performs the adaptive filtering. The newest sample is stored 
’ in a separate location since filtering and adaptation are performed at the 
H same time. Otherwise the oldest sample is over written before updating 

; the w(N-1) coefficient. 

- d_primary = xh *hcoff 


; d_output = xw *wcoff 
; LMS algorithm: 
: w(itl) = w(i)+d*mu_error*xw(n-i) for i = 0,1,...127 and n = 0O,1,..... 
a 
-asg AR2,H_COFF_P , H(Z) coeff buffer pointer 
-asg AR3, XH_DATA_P ; Gata coeff buffer pointer 
-asg AR6, INBUF_P ; input buffer address pointer 
-asg AR7, OUTBUF_P ; output buffer address pointer 
, for cal. primary input 
-asg AR4,W_COFF_P , W(z) coeff buffer pointer 
-asg AR5, XW_DATA_P ; Gata coeff buffer pointer 
-sect “filter” 
adapt_task: 
STM #H_FILT_SIZE, BK ; first circular buffer size 
STM #hcoff, H_COFF_P ; H_COFF_P --> last of sys coeff 
ADDM #1,d_adapt_count 
LD *INBUF_P+, A ; load the input sample 
STM #wcoff,W_COFF_P ; reset coeff buffer 
STL A, d_new_x ; read in new data 
LD d_new_x,A H 
STL A, *XH_DATA_P+0% ; store in the buffer 
RPTZ A, #H_FILT_SIZE-1 ; Repeat 128 times 
MAC *H COFF_P+0%,*XH_DATA_P+0%,A ; mult & acc:a = a+ (h * x) 
STH A,d_primary ; primary signal 
; start simultaneous filtering and updating the adaptive filter here. 
LD d_mu_e, T ; T = step_size*error 
SUB B,B , zero acc B 


STM # (ADPT_FILT_SIZE-2),BRC 7 set block repeat counter 
RPTBD l1ms_end-1 


MPY *XW_DATA _P+0%, A 7 error * oldest sample 
LMS *W_COFF_P, *XW_DATA _P , B = filtered output (y) 
; Update filter coeff 
ST A, *W_COFF_P+ 7 save updated filter coeff 
|| MPY *XW_DATA_P+03%,A ; error *x[n-(N-1) ] 
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Example 4—4. System Identification Using Adaptive Filtering Techniques (Continued) 


LMS *W_COFF_P, 


1ims_end 


STH 


ME 
3 
d 

E 


.end 


*XW_DATA_P 


A, *W_COFF_P 
*XW_DATA_P,A 


#d_new_x, *XW_DATA_P 


’ 


*W_COFF_P, *XW_DATA_P+0% 


B, d_output 
d_primary,A 
d_output,A 
A, d_error 


d_mu, T 


d_error,A 
A,d_mu_e 
d_error,A 

A, *OUTBUF_P+ 
#K_FRAME_SIZE,A 
d_adapt_count,A 
adapt_task,AGT 


#K_0,d_adapt_count 


’ 


B = accum filtered output y 
Update filter coeff 


final coeff 
x (0) *h(0) 


store the newest sample 


store the filtered output 


store the residual error signal 
A=u*e 

save the error *step_size 
residual error signal 

check if a frame of samples 


have been processed 


restore the count 


* This is an input file used by the adaptive filter program. 
system to be identified by the adaptive filter 


* The transfer function is th 
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.word 
.word 
.word 
.word 
.word 
.word 
.word 
.word 
.word 
.word 
.word 
.word 
.word 
.word 
.word 
.word 
.word 
.word 
.word 
.word 
.word 
.word 
.word 
.word 
.word 
.word 
.word 
.word 
.word 


OFFFDh 
24h 

6h 
OFFFDh 
3h 

3h 
OFFE9h 
7h 

12h 
1Ch 
OFFF3h 
OFFE8h 
3h 
1Eh 


PYyPPYP SY 


-word OCh 


PRELIMINARY 


PRELIMINARY Adaptive Filtering 


Example 4-4. System Identification Using Adaptive Filtering Techniques (Continued) 


-word 2Eh 
-word 28h 
-word OFFC6h 
-word 53h 
-word OFFBOh 
-word 55h 
-word OFF36h 
-word 5h 
-word OFFCFh 
-word OFF99h 
-word 64h 
-word 41h 
-word OFFF1h 
-word OFFDFh 
-word ODih 
-word 6Ch 
-word 57h 
-word 36h 
-word OAOh 
-word OFEE3h 
-word 6h 
-word OFEC5h 
-word OABh 
-word 185h 
-word OFFF6h 
-word 93h 
-word 1Fh 
-word 10Eh 
-word 59h 
-word OFEFOh 
-word 96h 
-word OFFBFh 
-word OFF47h 
-word OFF76h 
-word OFFOBh 
-word OFFAFh 
-word 14Bh 
-word OFF3Bh 
-word 132h 
-word 289h 
-word 8Dh 
-word OFE1Dh 
-word OFE1Bh 
-word OD4h 
-word OFF69h 
-word 14Fh 
-word 2AAh 
-word OFD43h 
-word OF98Fh 
-word 451h 
-word 13Ch 
-word OFEF7h 
-word OFE36h 
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Example 4—4. System Identification Using Adaptive Filtering Techniques (Continued) 
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.word 
.word 
.word 
.-word 
.word 
.word 
.word 
.word 
.word 
.word 
.word 
.word 
.word 
.word 
.word 
.word 
.word 
.word 
.word 
.word 
.word 
.word 
.word 
.word 
.word 
.word 
.word 
.word 
.word 
.word 
.word 
.word 
.word 
.word 
.word 
.word 
.word 
.word 
.word 
.word 
.word 
.word 
.word 
.word 
.word 


80h 
OFFBBh 
OFC8Eh 
10Eh 
37Dh 
6FAh 


lh 
OF 


198h 
OFE4Ch 
OFE78 
OF215h 


47 
74 
28 
OF 


D89h 


=) 


9h 
9h 
9h 
667h 


304h 
5SF8h 
34Fh 
47Bh 


OFF7Fh 
85Bh 
OF837h 
OF77Eh 
OFF80h 


OB 


OFO3A 


9Bh 


OF 


E66 


OFE28 
OFADO 
8C3h 

OF5D6 
14DCh 
OF3A7h 
OE542 


PYPPYPD 


=) 


r=) 


10F2h 


56 


6h 


26AAh 
15Ah 
2853h 


OE 


E95h 


93 
20 


Dh 
Dh 


1230h 
238Ah 
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4.4 Fast Fourier Transforms (FFTs) 


FFTs are an efficient class of algorithms for the digital computation of the N-point 
discrete Fourier transform (DFT). In general, their input sequences are assumed 
to be complex. When input is purely real, their symmetric properties compute the 
DFT very efficiently. 


One such optimized real FFT algorithm is the packing algorithm. The original 
2N-point real input sequence is packed into an N-point complex sequence. 
Next, an N-point FFT is performed on the complex sequence. Finally the 
resulting N-point complex output is unpacked into the 2N-point complex 
sequence, which corresponds to the DFT of the original 2N-point real input. 


Using this strategy, the FFT size can be reduced by half, at the FFT cost func- 
tion of O(N) operations to pack the input and unpack the output. Thus, the real 
FFT algorithm computes the DFT ofa real input sequence almost twice as fast 
as the general FFT algorithm. The following subsections show how to perform 
a 16-point real FFT (2N = 16). 


4.4.1. Memory Allocation for Real FFT Example 
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The memory organization for the real FFT example in Chapter10, Application 
Code Examples, uses the memory configuration shown in Figure 4—8 on page 
4-20. 


The following tables give the organization of values in data memory from the 
beginning of the real FFT algorithm to its end. Initially, the original 2N-point real 
input Sequence, a(n), is stored in the lower half of the 4N-word data processing 
buffer, as shown in Figure 4—9 on page 4-21. 
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Figure 4-8. Memory Allocation for Real FFT Example 


Program Memory 


Program space 
(about 224 words) 


Data Memory 


0000h . 4 
emory-mapped registers 
005Fh Pewee 


0061h Index of twiddle table 
0062h Index of data processing buffer 


Sine table 


cosine —» Cosine table 


Data processing buffer 


(2048 words maximum) 
(bottom half serves as real FFT input 
buffer initially) 


fft_data —» 


Power spectrum 


real_fft_output —» output buffer 
(1024 words maximum) 
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Figure 4-9. Data Processing Buffer 


Data Memory 
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4.4.2 Real FFT Example 


The ’54x real FFT algorithm is a radix-2, in-place DFT algorithm. It is shown 
in the following subsections in four phases : 


1) Packing and bit-reversal of input 
2) N-point complex FFT 

3) Separation of odd and even parts 
4) Generation of final output 


Initially, any real input sequences of 16 to 1024 points can be used by simply 
modifying the constants K_FFT_SIZE and K_LOGN appropriately, defined in 
file main.inc. (The real input size is described as 2N and the FFT size in phase 
two as N.) For a 256-point real input, for example, K_FFT_SIZE must be set 
to 128, not 256, and K_LOGN must be 7, not 8. Input data is assumed to be in 
Q15 format. 


4.4.2.1 Phase 1: Packing and Bit-Reversal of Input 


4-22 


In phase 1, the input is bit-reversed so that the output at the end of the entire 
algorithm is in natural order. First, the original 2N-point real input sequence is 
copied into contiguous sections of memory labeled real_fft_input and inter- 
preted as an N-point complex sequence, d[n]. The even-indexed real inputs 
form the real part of d[n] and the odd-indexed real inputs form the imaginary 
part. This process is called packing. (n is a variable indicating time and can 
vary from 0 to infinity, while N is a constant). Next, this complex sequence is 
bit-reversed and stored into the data processing buffer, labeled fft_data. 


1) Arrange the real input sequence, a(n) forn =0, 1,2, ...n-—1, as shown 
in Figure 4—9. Divide a(n) into two sequences as shown in Figure 4—10. 
The first is the original input sequence from 0C10h to OC1Fh. The other 
is a packed sequence: 


forn=0,1,2,...,N-1 


2) Form the complex FFT input, d(n), by using r(n) for the real part and i(n) 
for the imaginary part: 


d(n) = r(n) + f i(n) 


3) Store d(n) in the upper half of the data processing buffer in bit-reversed 
order as shown in Figure 4—10 on page 4-23. 
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Figure 4-10. Phase 1 Data Memory 


OC1Fh 
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4.4.2.2 Phase 2: N-Point Complex FFT 


In phase 2, an N-point complex FFT is performed in place in the data-processing 
buffer. The twiddle factors are in Q15 format and are stored in two separate 
tables, pointed to by sine and cosine. Each table contains 512 values, corre- 
sponding to angles ranging from 0 to almost 180 degrees. The indexing scheme 
used in this algorithm permits the same twiddle tables for inputs of different sizes. 
Since circular addressing indexes the tables, the starting address of each table 
must line up to an address with Os in the eight LSBs. 


1) Perform an N-point complex FFT on d(n). The resulting sequence is 
D[k] = F{d(n)} = Rk] + j I[k] 
where R[k] and I[k] are the real and imaginary parts of D[k], respectively. 
2) Since the FFT computation is done in place, the resulting sequence, D[k], 
occupies the upper half of the data-processing buffer, as shown. The lower 


half of the data processing buffer still contains the original real input 
sequence, a(n). This is overwritten in phase 3. 


3) Allthe information from the original 2N-point real sequence, a(n), is contained 
in this N-point complex sequence, D[k]. The remainder of the algorithm un- 
packs D[k] into the final 2N-point complex sequence, A[k] = F{a(n)}. 
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Figure 4—11.Phase 2 Data Memory 
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4.4.2.3 Phase 3: Separation of Odd and Even Parts 
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Phase 3 separates the FFT output to compute four independent sequences: 
RP, RM, IP, and IM, which are the even real, odd real, even imaginary, and the 
odd imaginary parts, respectively. 


1) 


D[k] is separated into its real even part, RP[k], real odd part, RM[k], imagi- 
nary even part, IP[k], and imaginary odd part, IM[k,] according to the fol- 
lowing equations: 


RP{k] = RP[N-K] = 0.5 * (A[K] + REN-K]) 
RMA = -RMIN-K] = 0.5 * (RIK] — R[N-A)) 
IP{K| = IPIN-K] = 0.5 * (fK] + IN-K)) 

IM{K] = -IM[N-K] = 0.5 * (i[K] — I[N-K)) 
RP{O] = ALO] 

IP{O] = 0] 

RM] = IM[0] = RM[N2] = IM[N2] = 0 
RPIN2] = PRINZ] 

IPIN2] = [N2] 


The table below shows the organization of the values at the end of phase 
three. The sequences RP[k] and IP[k] are stored in the upper half of the 
data processing buffer in ascending order; the sequences RM[k] and IM[k] 
are stored in the lower half in descending order. 
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Figure 4—12. Phase 3 Data Memory 


RP(0] = ld] 
4] 


[econ [ries 
[ocian [wel 
[ocisn [Awe 
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4.4.2.4 Phase 4: Generation of Final Output 


Phase 4 performs one more set of butterflies to generate the 2N-point complex 
output, which corresponds to the DFT of the original 2N-point real input se- 
quence. The output resides in the data processing buffer. 


1) The four sequences, RP[k], RM[k], IP[k], and IM[k], are used to compute 
the real FFT of a(n) according to the following equations. 


AR{k] = AR[2N — k] = RP[k] + cos(k ® /N) * IP[k] — sin(k ™ /N)*RM{K 
Ak] = —Al[2N — k] = IM[K| — cos(k ™ /N) * RM{K] - sin(k ™ /N) * IP{K] 
AR{0] = RP[0] + /P{O] 
Al0] = IM[O] — RM{(O} 
ARN] = AlO] — 10] 
ALN = 0 
where: 
A[k] = A[2N-K] = ARTA] + j Al[k] = Ffa(n)} 

2) The real FFT outputs fill up the entire 4N-word data processing buffer. 
These outputs are real/imaginary, interleaved, and in natural order, as 
shown in Figure 4—13 on page 4-29. The values RM[0] and IM[O] are not 


stored because they are not used to compute the final outputs in phase 
4. 
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Figure 4-13. Phase 4 Data Memory 
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Chapter 5 


Resource Management 


This chapter introduces features of the 54x that improve system performance. 
These features allow you to conserve power and manage memory. You can 
improve the performance of any application through efficient memory manage- 
ment. Some issues include: 


On-chip memory versus off-chip memory 


Random access variables that use direct memory addressing versus aggre- 
gate variables that include structures/arrays 


a 
Lj 


The use of pointers for accessing the arrays and pointers 
Alignment of long words to even addresses 
The K-boundary requirement for circular buffers 


UOUCcU 


Allocation of stack 


This chapter also discusses unique features of the 548 and ’LC548 that help 
when an application needs a large amount of memory. 
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5.1 Memory Allocation 
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The ’54x can access a large amount of program and data memory (64K words 
each), but can handle only a limited amount of on-chip memory. On-chip 
memory accesses reduce the cycle time, since there are eight different internal 
buses on the ’54x but there is only one external bus for off-chip accesses. This 
means that off-chip operation requires more cycles to perform an operation 
than on-chip operation. 


The DSP uses wait-state generators to interface to slower memories. The sys- 
tem, then, cannot run at full speed. If on-chip memory consists of dual access 
RAM (DARAM), accessing two operands from the same block does not incur 
a penalty. Using single access RAM (SARAM), however, incurs a cycle penal- 
ty. You can use on-chip ROM for tables to make efficient use of memory. 


Random-access variables use direct addressing mode. This allocates all the 
random variables on a single data page, using one data page initialization for 
the application. Data-page relative memory addressing makes efficient use of 
memory resources. Each data variable has an associated lifetime. When that 
lifecycle is over, the data variable ceases to exist. Thus, if two data variables 
have non-overlapping lifetimes, both can occupy the same physical memory. 
All random variables, then, can form unions in the linker command file. 


The actual lifetime of a variable determines whether it is retained across the 
application or only in the function. By careful organization of the code memory, 
resources can be used optimally. Aggregate variables, such as arrays and 
structures, are accessed via pointers located within that program’s data page. 
Aggregate variables reside elsewhere in memory. Depending upon the life- 
time of the arrays or structures, these can also form unions accordingly. 


Memory management is required for interrupt-driven tasks. Often, program- 
mers assume that all CPU resources are available when required. This may not 
be the case if tasks are interrupted periodically. These interrupts do not require 
many CPU resources, but they force the system to respond within a certain time. 
To ensure that interrupts occur within the specified time and the interrupted code 
resumes as soon as possible, you must use low overhead interrupts. If the ap- 
plication requires frequent interrupts, you can use some of the CPU resources 
for these interrupts. For example, when all CPU resources are used, simply sav- 
ing and restoring the CPU’s contents increases the overhead for an interrupt 
service routine (ISR). 


A dedicated auxiliary register is useful for servicing interrupts. Allowing inter- 
rupts at certain places in the code permits the various tasks of an application 
to reuse memory. If the code is fully interruptible (that is, interrupts can occur 
anywhere and interrupt response time is assured within a certain period), 
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memory blocks must be kept separate from each other. On the other hand, if 
acontext switch occurs at the completion of a function rather than in the middle 
of execution, the variables can be overlapped for efficiency. This allows vari- 
ables to use the same memory addresses at different times. 


Long words must be aligned at even boundaries for double-precision opera- 
tions; thatis, the most significant word at an even address and the least signifi- 
cant word at an odd address. Circular buffers start at a K boundary, where K 
is the smallest integer that satisfies 2K > R and Ris the size of the circular buff- 
er. If an application uses circular buffers of different sizes, you must use the 
align directive to align the buffers to correct sizes. You can do this by allocating 
the largest buffer size as the first alignment, the next highest as the second 
alignment, and so forth. Example 5-1 shows the memory management align- 
ment feature where the largest circular buffer is 1024 words, and therefore, is 
assigned first. A 256-word buffer is assigned next. Unused memory can be 
used for other functions without conflict. 


Example 5—1. Memory Management 
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DRAM : origin = 0x0100, length = 0x1300 
inpt_buf : {} > DRAM,align(1024)PAGE 1 
outdata : {} > DRAM,align(1024) PAGE 1 
UNION : > DRAM align(1024) PAGE 1 
{ 

fft_bffr 

adpt_sct: 


{ 

* (bufferw) 

.+=80h; 

* (bufferp) 

} 
} 
UNION ¢ > DRAM align(256) PAGE 1 
{ 


fir _bfr 
Gir br 
Cort air 
bufferh 
twid_sin 


} 


UNION 7 > DRAM align(256) PAGE 1 
{ 
fir scott 
Cir _bfrl 
bufferx 
twid_cos 
} 
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Stack allocation can also benefit from efficient memory management. The 
stack grows from high to low memory addresses. The stack pointer (SP) 
decrements the stack by 1 before pushing its contents, which must be 
preserved, onto the stack and post increments after a pop. The bottom location 
is added to the stack, giving the actual stack size. The last element is always 
empty. Whether the stack is on chip or off chip affects the cycle count for acces- 
sing data. 


Example 5—2 shows stack initialization when the application is written in 
assembly. The variable SYSTEM_STACK holds the size of the stack. It is 
loaded into the SP, which points to the end of the stack. The predecrement 
during the push operation and the postincrement during the pop cannot over- 
flow the stack. Example 5—3 shows stack initialization when the application is 
written in C. 


Example 5—2. Stack Initialization for Assembly Applications 


K_STACK_SIZE -set 100 
STACK -usect “stack”, K_STACK_ SIZE 
SYSTEM_STACK .set STACK+K_STACK_STZE 
.ref SYSTEM_STACK 
STM #SYSTEM_STACK, SP ; initialization 
; of SP- this is done 
, vectors.asm 
stack : {} DRAM PAGE 1 y; initialization of stack 
ti 


in linker command file 


The compiler uses a stack to allocate local variables, pass arguments, and 
save the processor status. The stack size is set by the linker and the default 
size is 1 K words. In Example 5-3, the .stack section creates a stack size of 
1 K words. A section of 100 words is created, referenced as top_stck and 
btm_stck, for the CPU requirements. The rest of the stack (1024 — 100) words 
can be used for passing arguments and local variables. Only the btm_stck is 
referenced in the code; hence, several sections can be created within the 1 K 
words of the stack. 


Example 5—3. Stack Initialization for C Applications 
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.ref btm_stck ; bottom of stack label 
STM #btm_stck, SP ; initialization of SP - this is done 
; vectors.asm 
-stack : 
{ 
top_stck =.; /* top of stack */ 
.+=100; /* size of the stack */ 
btm_stck =.; /* this is done in linker command 
} file */ 
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5.2 Overlay Management 
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Some systems use a memory configuration in which all or part if the memory 
space is overlaid. This allows the system to map different banks of physical 
memory into and out of a single address range. Multiple banks of physical 
memory can overlay each other at one address range. This is achieved by 
setting the OVLY bit in the PMST register. This is particularly useful in loading 
the coefficients of a filter, since program and data use the same physical 
memory. 


If an application needs more than 64K words of either data or program memory, 
two options are available. The first extends the 16-bit address line to a 
16 + n-address line for the extended memory space. The ’548 provides 16 + 7 
address lines to access 8M words of program space. The other option uses 
an external device that provides upper addresses beyond the 16-bit memory 
range. The DSP writes a value to a register located in its I/O space, whose data 
lines are the higher address bits. It implements bank switching to cross the 64K 
boundary. Since the bank switch requires action from the DSP, frequent 
switching between the banks is not very efficient. It is more efficient to partition 
tasks within a bank and switch banks only when starting new tasks. 


The ’LC548 is designed to support a much larger program space of 8M words. 
lts memory-mapped register controls paging, and its extra instructions 
address extended program space. The OVLY bit configures the 8M words for 
on- or off-chip memory. If OVLY = 1, the lower half (128 pages) is a shared, 
on-chip, 32K-word block and the remaining 4M words are off-chip. If OVLY = 0, 
the entire 8M words of memory are off chip. 
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5.3 Memory-to-Memory Moves 


There are various reasons for performing memory-to-memory moves. These 
reasons include making copies of buffers to preserve the original, moving con- 
tents from ROM to RAM, and moving copies of code from their load location 
to their execution location. Example 5—4 implements memory-to-memory 
moves on the ’54x using single-instruction repeat loops. 


Example 5—4.Memory-to-Memory Block Moves Using the RPT Instruction 


OE Ee TORE CEC EE PS ES SEE Te Ee EEE ET TEE EE Ee Se a a Pe eS 
; This routine uses the MVDD instruction to move 
; information in data memory to other data memory 
; locations. 
(A a A A 0 0 2200 
, 
MOVE_DD: 
STM #4000h, AR2 ;Load pointer to source in 
;data memory. 


STM #100h, AR3 ;Load pointer to 
;destination in data memory. 

RPT # (1024-1) ;Move 1024 value. 

MVDD *AR2+, *AR3+ 

RE 


OP OPS PP EEE EE EE PE OE EP PPPS ES EP EE ES ES PE PPE TE CPE PES CPE PP PP TF 
; This routine uses the MVDP instruction to move external 
; data memory to internal program memory. 
TUTTO TT TOTO TOTO TET ETT TEE OTE OPT 
r 
MOVE_DP: 

STM #0E0O00h,AR1 ;Load pointer to source in 

,;data memory. 


RPT # (8192-1) ;Move 8K to program memory space. 
MVDP *AR1+,#800h 
RET 
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Example 5-4. Memory-to-Memory Block Moves Using the RPT Instruction (Continued) 
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FES EEE EE OS ES PS EEE TPP EEE EE SP PE CE CP EEE EE EP PE EET EP ET RE 
; This routine uses the MVPD instruction to move external 
; program memory to internal data memory. 


LOL OE Pee PE Ee PE ek PP Pe ee LE ee Se OSL SEP PE Ee ee ee ee Se 


Lf 


MOVE_PD: 
STM #0100h, AR1L ;Load pointer to 
;destination in data memory. 
RPT # (128-1) ;Move 128 words from external 
MVPD #3800h, *AR1+ ;program to internal data 
;memory. 
RET 


i a ae A de ie es ee oe de a eee ee a ay ae ee es ee ee ee ee a a ae ee et Se ee ee 
; This routine uses the READA instruction to move external 
; program memory to internal data memory. This differs 

; from the MVPD instruction in that the accumulator 

; contains the address in program memory from which to 

; transfer. This allows for a calculated, rather than 

; pre-determined, location in program memory to be 

; specified. 


| Ae Ae fee ee ae ee A A A A A A A A A A A A A A A A A A A 


, 


READ_A: 
STM #0100h, AR1 ;Load pointer to 
;destination in data memory. 
RPT # (128-1) ;Move 128 words from external 
READA *ARI+ ;program to internal data 
;memory. 
RET 


FETE LEK ERS CE CL TPE EE FP PES EE OL PPE Ee EP EE OS CT TS EOE OT EP ee 
; This routine uses the WRITEA instruction to move data 

; memory to program memory. The calling routine must 

; contain the destination program memory address in the 

7 accumulator. 


| Ae fe fae fe fe ee fe A A A A A A A A A A A A A A A A A A A 


, 


WRITE_A: 
STM #380h, AR1L ;Load pointer to source in 
;data memory. 
RPT # (128-1) ;Move 128 words from data 
WRITA *ARI+ ;memory to program memory. 
RET 
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5.4 Power Management 
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The ’54x family of DSPs exhibits very low power dissipation and flexible power 
management. This is important in developing applications for portable sys- 
tems, particularly wireless systems. Three aspects of power management are 
discussed here: on- versus off-chip memory, the use of HOLD, and the use of 
IDLE modes. 


To fetch and execute instructions from on-chip memory requires less power 
than accessing them from off-chip memory. The difference between these two 
accesses becomes noteworthy if a large piece of code resides off chip and is 
used more frequently than the on-chip code. The code can be partitioned so 
that the code that consumes the most power and is used most frequently is 
placed on-chip. (Masked ROM devices are another alternative for very high- 
performance applications.) 


If the program is executed from internal memory and no external access 
occurs, switching of address outputs can be disabled with the AVIS bit in the 
PMST register. This feature saves a significant amount of power. However, 
once the AVIS bit is set, the address bus is still driven in its previous state. The 
external bus function in the bank-switching control register (BSCR) contrib- 
utes to the state of the address, control, and data lines. If it is disabled, the ad- 
dress and data buses, along with the control lines, become inactive after the 
current bus cycle. 


The HOLD signal and the HM bit initiate a power-down mode by either shutting 
off CPU execution or continuing internal CPU execution if external access is 
not necessary. This makes external memory available for other processors. 
The timers and serial ports are not used, and the device can be interrupted and 
serviced. 


Using the IDLE1, IDLE2, and IDLE3 modes dissipates less power than normal 
operation. The system clock is not halted in IDLE1, but CPU activities are 
stopped. Peripherals and timers can bring the device out of power-down 
mode. The system can use the timer interrupt as a wake-up if the device needs 
to be in power-down mode periodically. The IDLE2 instruction halts both CPU 
and peripherals. Unlike the IDLE1 mode, an external interrupt wakes up the 
processor in IDLE2. The IDLE2 mode saves a significant amount of power, 
compared to IDLE1. The IDLE3 mode shuts off the internal clock, also saving 
power. 
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Arithmetic and Logical Operations 


This chapter shows how the ’54x supports typical arithmetic and logical opera- 
tions, including multiplication, addition, division, square roots, and extended- 


precision operations. 
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The 54x implements division operations by using repeated conditional subtrac- 
tion. Example 6—1 uses four types of integer division and modulus: 


Type |: 32-bit by 16-bit unsigned integer division and modulus 
Type Il: 32-bit by 16-bit signed integer division and modulus 
Type Ill: 16-bit by 16-bit unsigned integer division and modulus 
Type IV: 16-bit by 16-bit signed integer division and modulus 


SUBC performs binary division like long division. For 16-bit by 16-bit integer 
division, the dividend is stored in low part accumulator A. The program repeats 
the SUBC command 16 times to produce a 16-bit quotient in low part accumu- 
lator A and a 16-bit remainder in high part accumulator B. For each SUBC sub- 
traction that results in a negative answer, you must left-shift the accumulator 
by 1 bit. This corresponds to putting a 0 in the quotient when the divisor does 
not go into the dividend. For each subtraction that produces a positive answer, 
you must left shift the difference in the ALU output by 1 bit, add 1, and store 
the result in accumulator A. This corresponds to putting a 1 in the quotient 
when the divisor goes into the dividend. 


Similarly, 32-bit by 16-bit integer division is implemented using two stages of 
16-bit by 16-bit integer division. The first stage takes the upper 16 bits of the 
32-bit dividend and the 16-bit divisor as inputs. The resulting quotient 
becomes the higher 16 bits of the final quotient. The remainder is left shifted 
by 16 bits and adds the lower 16 bits of the original dividend. This sum and the 
16-bit divisor become inputs to the second stage. The lower 16 bits of the 
resulting quotient is the final quotient and the resulting remainder is the final 
remainder. 


Both the dividend and divisor must be positive when using SUBC. The division 
algorithm computes the quotient as follows: 


1) The algorithm determines the sign of the quotient and stores this in 
accumulator B. 


2) The program determines the quotient of the absolute value of the numera- 
tor and the denominator, using repeated SUBC commands. 


3) The program takes the negative of the result of step 2, if appropriate, ac- 
cording to the value in accumulator B. 
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For unsigned division and modulus (types | and III), you must disable the sign 
extension mode (SXM = 0). For signed division and modulus (types II and IV), 
turn on sign extension mode (SXM = 1). The absolute value of the numerator 
must be greater than the absolute value of the denominator. 


Example 6—1. Unsigned/Signed Integer Division Examples 


7; File Name: DIV_ASM.ASM 
7; Title: Divide & Modulus - Assembly Math Utilities. 
7; Original draft: Alex Tessaralo 


7; Modified for ’54x: Simon Lau & Philip Jones 
7; Texas Instruments Inc. 


vreoege 


7; Target: C54xX 

rroane 

ad 

7; Contents: DivModUI32 ; 32-bit By 16-bit Unsigned Integer Divide 
a ; And Modulus. 

rae DivModUI16 ; 16-bit By 16-bit Unsigned Integer Divide 
ae ; And Modulus. 

ae DivModI32 ; 32-bit By 16-bit Signed Integer Divide 
ae ; And Modulus. 

eG DivModI16 ; 16-bit By 16-bit Signed Integer Divide 
. ; And Modulus. 

rhoane 

ad 

7; History: mm/dd/yy | Who | Description Of Changes. 

ii + tH = a = 

7; 08/01/96 | Simon L. | Original draft. 


tr ad 
7; Module Name: DivModUI32 


| oe Aa 


7; Description: 32 Bit By 16 Bit Unsigned Integer Divide And Modulus 


vroewe nT 


7; Usage ASM: 


77 ~.bss d_NumH, 1 ; 00000000h to FFFFFFFFh 
pe «bss d_NumL, 1 

77 .bss d_Den,1 ; 0000h to FFFFh 

te. abss d_QuotH,1 ; 0O0000000h to FFFFFFFFh 
ee. SS d_QuotL,1 

77 ~.bss d_Rem, 1 ; 0O000h to FFFFh 

te 

77~6CALL DivModUI32 

TS a Pee Te eS a ee rad 
77 Input d_NumH 

ae d_NumL 

a d_Den 
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tr 
77 Modifies: 


Pe Output: 


Unsigned/Signed Integer Division Examples (Continued) 


SXM 


77 accumulator A 


d_QuotH 
d_QuotL 
d_Rem 
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77 Algorithm: Quot = Num/Den 
Hes Rem = Num%Den 
pipes NumH = n3|n2 QuotH = q3]|q2 
a NumL = n1|no QuotL = ql|q0 
3 Den = da1|do Rem = r1|ro 
av 
Hee Phasel: t1|to|q3|q2 =A (after repeating SUBC 16 times) 
vr ————————————e 
i di|do ) 00|00|n3|n2 =a (before) 
rr 
tr 
He Phase2: r1|r0|ql|qo =A (after repeating SUBC 16 times) 
a ————— 
i dijdo ) tijtO|n1jno =a (before) 
ef 
of NOTES: Sign extension mode must be turned off. 
rrore 
.def DivModUI32 
.ref d_NumH 
.ref d_NumL 
.ref d_Den 
.ref d_QuotH 
.ref d_QuotL 
.ref d_Rem 
.textDivModUI32: 
RSBX SXM ; Sign extention mode off 
LD d_NumH,A 
RPT # (16-1) 
SUBC d_Den,A 
STLA, d_QuotH 
XOR d_QuotH,A ; clear AL 
OR d_NumL,A ; AL = Numb 
RPT # (16-1) 
SUBC d_Den,A 
STLA, d_QuotL 
STHA, d_Rem 
RET 
tr tr 
7; Module Name: DivModUI16 


vroegr 


7; Description: 


7; Usage ASM: 
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16 Bit By 16 Bit Unsigned Integer Divide And Modulus 
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Example 6-1. Unsigned/Signed Integer Division Examples (Continued) 
ae .oss d_Num, 1 ; 0O000h to FFFFh 
es .bss d_Den,1 ; 0000h to FFFFh 
a .bss d_Quot,1 7 0000h to FFFFh 
a .bss d_Rem, 1 ; 0000h to FFFEh 
rae 
Has CALL DivModUI16 
i SRR (ca aan eta ea a a a a a a a a ee a a a 
77 Input d_Num 
mr d_Den 
ir 
7; Modifies: SXM 
a accumulator A 
ir 
77 Output: d_Quot 
vr d_Rem 
Fat aa 2) aml a ea ak lama A a SE a 
7; Algorithm: Quot = Num/Den 
7 Rem= Num%SDen 
tr 
nr Num= n1|no Quot = q1|qo0 
ii Den= d1|do Rem = r1|/r0 
tie 
ii r1|r0|ql|q0 =A (after repeating SUBC 16 times) 
ver a, 
or di|do ) 00|00|n1|no0 =a (before) 
ver 
ret NOTES: Sign extension mode must be turned off. 
ti r vr 
.def DivModUI16 
.ref d_Num 
.ref d_Den 
.ref d_Quot 
.ref d_Rem 
-text 
DivModUI16: 
RSBX SXM 7 Sign extention mode off 
LD @d_Num, A 
RPT # (16-1) 
SUBC @d_Den,A 
STL A, @d_Quot 
STH A, @d_Rem 
RET 
r 
77, Module Name: DivModI32 


vrowe 


7; Description: 32 Bit By 16 Bit Signed Integer 


Divide And Modulus. 


ie dae ae 


7; Usage ASM: 


a .bss d_NumH, 1 * 
a .bss d_NumL, 1 

ae .bss d_Den,1 . 
s - bss da. OuotH, 1 : 


PRELIMINARY 


80000001h to 7FFFFFFFh 


8000h to 7FFFh 
80000001h to 7FFFFFFFh 
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Example 6-1. Unsigned/Signed Integer Division Examples (Continued) 


ae .bss d_QuotL,1 
_ .bss d_Rem,1 ; 8000h to 7FFFh 
tr 
7s CALL DivMod1I32 
roar aan Seo = ead es te ar ome ge TT aa ae A 
7; Input: d_NumH 
a d_NumL 
a d_Den 
rr 
7; Modifies: SXM 
ii T 
ii accumulator A 
Cir accumulator B 
rr 
77 Output: d_QuotH 
He d_QuotL 
ii d_Rem 
rrore = = ae = Se - = —= <= na 
7; Algorithm: Quot = Num/Den 
3 Rem = Num%sDen 
Hess ie Signed division is similar to unsigned division except that 
ae the sign of Num and Den must be taken into account. 
oe First the sign is determined by multiplying Num by Den. 
3 Then division is performed on the absolute values. 
vr 
i NumH = n3|n2 QuotH = q3|q2 
Hae NumL = nl}no QuotL = ql|q0 
Me Den = dl1}do Rem = r1}r0 
av 
Be Phasel: t1|to q3|q2 =A (after repeating SUBC 16 times) 
a ——————— SSE 
i di|do ) 00|00|n3|n2 =a (before) 
tr 
rr 
ra Phase2: r1|ro ql|qoO =A (after repeating SUBC 16 times) 
rv — a a, 
i di]do ) ti|tO|n1|n0 =a (before) 
vr 
ae NOTES: Sign extension must be turned on. 
rrore 
.def DivModI32 
.cef d_NumH 
.ref d_NumL 
.ref d_Den 
.ref d_QuotH 
.cef d_QuotL 
.cef d_Rem 
stext 
DivModI32: 
SSBX SXM 7 Sign extention mode on 
LD d_Den,16,A 
MPYA d_NumH , B has sign of quotient 
ABS A 
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Example 6-1. Unsigned/Signed Integer Division Examples (Continued) 
STHA ,d_Rem ; G_Rem = abs(Den) temporarily 
LD d_NumH,16,A 
ADDS d_NumL,A 
ABS A 
STH A, d_QuotH , A_QuotH = abs (NumH) temporarily 
STL A,d_QuotL , A_QuotL = abs (NumL) temporarily 
LD d_QuotH,A 
RPT # (16-1) 
SUBC d_Rem,A 
STL A, d_QuotH ; AH = abs (QuotH) 
XOR d_QuotH,A ; clear AL 
OR d_QuotL,A ; AL = abs (NumL) 
RPT # (16-1) 
SUBC d_Rem,A 
STL A,d_QuotL ; AL = abs (QuotL) 
STH A,d_Rem ; AH = Rem 
BCD DivModI32Skip, BGEQ , if B neg, then Quot = 
; ~abs (Quot) 
LD d_QuotH,16,A 
ADDS d_QuotL,A 
NEG A 
STH A,d_QuotH 
STL A,d_QuotL 
DivModI32Skip: 

RET 

Module Name: DivModI16 


ae Ae A 


7; Description: 16 Bit By 16 Bit Signed Integer Divide And Modulus. 

trop So ee Sr op Sg Pe et een ee Tee eg ee ee Sere po et we eg a 
7; Usage ASM: 

ae -bss d_Num, 1 ; 8000h to 7FFFh (Q0.15 format) 

ee -bss d_Den,1 ; 8000h to 7FFFh (Q0.15 format) 

re -bss d_Quot,1 ; 8000h to 7FFFh (Q0.15 format) 

ae -bss d_Rem, 1 ; 8000h to 7FFFh (Q0.15 format) 

ver 

re CALL DivModI16 

rreore —— a ae ee —— eee a oi = aa - Sa i? 
77 Input: d_Num 

re d_Den 

tr 

7; Modifies: AR2 

ii T 

rae accumulator A 

a accumulator B 

ii SXM 

tr 

77 Output d_Quot 

ii d_Rem 

Vl HOU ee ee ee ee Wa Ie ee Be Oy ee Pr Oe i? 
7; Algorithm: Quot = Num/Den 

3; Rem = Num%sDen 
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Example 6—1. Unsigned/Signed Integer Division Examples (Continued) 


oi Signed division is similar to unsigned division except that 
Hae the sign of Num and Den must be taken into account. 
He First the sign is determined by multiplying Num by Den. 
ree Then division is performed on the absolute values. 
vv 
ee Num = n1|no Quot = ql|q0d 
Wes Den = d1|do Rem = r1}ro 
vv 
ii r1|r0|q1|qo =A (after repeating SUBC 16 times) 
v x ——_—— a, 
A d1]dO ) o00]00|n1JnO0 =A (before) 
Ff 
Hae NOTES: Sign extension mode must be turned on. 
tr 
rr 
.def DivModI16 
.ref d_Num 
ref d_Den 
ref d_Quot 
ref d_Rem 
text 
DivModI16: 
SSBX SXM 7 Sign extention mode on 
STM #d_Quot, AR2 
LD d_Den,16,A 
MPYA d_Num ; B has sign of quotient 
ABS A 
STH A,d_Rem ; GA_Rem = abs(Den) temporarily 
LD d_Num,A 
ABS A ; AL = abs (Num) 
RPT # (16-1) SUBC d_Rem,A 
STL A, d_Quot ; AL = abs (Quot) 
STH A,d_Rem ; AH = Rem 
LD #0,A 
SUB d_Quot,16,A ; AH = -abs (Quot) 
SACCD A, *AR2, BLT ; If B neg, Quot = ~—abs (Quot) 
RET 


mr ta 


;;End Of File. 
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6.2 Sines and Cosines 


Sine-wave generators are used in signal processing systems, such as com- 
munications, instrumentation, and control. In general, there are two methods 
to generate sine and cosine waves. The firstis the table look-up method, which 
is used for applications not requiring extreme accuracy. This method uses 
large tables for precision and accuracy and requires more memory. The se- 
cond method is the Taylor series expansion, which is more efficient. This meth- 
od determines the sine and cosine of an angle more accurately and uses less 
memory than table look-up, and it is discussed here. 


The first four terms of the expansion compute the angle. The Taylor series ex- 
pansions for the sine and cosine of an angle are: 


. = xs. x? x’, x9 
a ey a Yo 


ag XP XO XE fg XE 
ai) Sl s(1 35) 


= x4 (1-25 (1-25)) 

= 8 (1-28 (1-25(1-35))) 

= (1-35 ('-6(1-25(1-88)}) 
4X 4 xt xé 4% 


21° 4! 6! 8! 
pee oe * (1-25) 


cos(@) = 


217 41 6l\ 7.8 
= 4 XE Xt Gxt fy XP 
: e+ O(1 ae" 5) 
_X?f 4 x2 (4 x2 (4 Xx? 
! e (1 (1 zal" +5))) 


The following recursive formulas generate the sine and cosine waves: 


sin nO = 2 cos(@)sin{(n—1 )6} — sin{(n-2)6} 
cos n@ = 2 cos(@)cos{(n—1)6} — cos{(n-2)6} 


These equations use two steps to generate a sine or cosine wave. The first 
evaluates cos(8) and the second generates the signal itself, using one multiply 
and one subtract for a repeat counter, n. 
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Example 6—2 and Example 6-3 assumes that the delayed cos((n—1)) and 
cos((n—2)) are precalculated and are stored in memory. The Taylor series 
expansion to evaluate the delayed cos((n—1)), cos((n—2))/sin((n—1)), and 
sin((n—2)) values for a given 8 can also be used. 


Example 6—2. Generation of a Sine Wave 


; Functional Description 

; This function evaluates the sine of an angle using the Taylor series 
; expansion. 

; sin(theta) = x(1-x*2/2*3 (1-x*2/4*5 (1-x*2/6*7 (1-x*2/8*9)))) 


-mmregs 
.def d_x,d_squr_x,d_coff,d_sinx,C_l 
d_coff -sect “coeff” 
.-word Olc7h 
-word 030bh 
-word 0666h 
.word 1556h 
d_x -usect "sin_vars”,1 
d_squr_x .usect “"sin_vars”,1 
d_temp -usect "sin_vars”,1 
d_sinx -usect "sin_vars”,1 
Ca -usect "sin_vars”,1 
text 
sin_start: 
STM #d_coff,AR3 yp c1=1/72, c2=1/42,c3=1/20, 
; c4=1/6 
STM #d_x,AR2 ; input value 
STM #C_1,AR4 ; Al, A2, A3, A4 
sin_angle: 
LD #d_x, DP 
ST #6487h, d_x 7; pi/4 
ST #7£f£f£h,C_1 
SQUR *AR2+,A , let x*2 =P 
ST A, *AR2 ; AR2 - > x%2 
|| LD *AR4,B ; 
MASR *AR2+,*AR3+,B,A 3; (1-x%2) /72 
MPYA A ; 1l-x*2 (1-x*2) /72 
; T = x*2 
STH A, *AR2 
MASR *AR2—,*AR3+,B,A ; A = 1-x%*2/42 (1-x%*2/72) 
; T =x*2 (1-x*2/72) 
MPYA *AR2+ ; B = A(32-16) *x*2 
ST B, *AR2 ; 
LD *AR4,B : B= Cot 
MASR *AR2—, *AR3+,B,A 7 A = 1-x*2/20 (1-x*2/42 (1-x*2/72) 
MPYA *AR2+ ; B = A(32-16) *x“*2 
ST B, *AR2 
LD *AR4,B 
MASR *AR2—,*AR3+,B,A , AR2 - > d_squr_x 
MPYA d_x 
STH B, dsinx ; sin(theta) 
RET 
end 
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Example 6-2. Generation of a Sine Wave (Continued) 


; Functional Description 

; This function generates the sine of angle. Using the recursive given above, the 

7 cosine of the angle is found and the recursive formula is used to generate the 
; Sine wave. The sin(n-1) and sin(n-2) can be calculated using the Taylor 

; series expansion or can be pre-calculated. 


-mmregs 
.ref cos_prog,cos_start 
d_sin_delayl -usect "cos_vars”,1 
d_sin_delay2 -usect "cos_vars”,1 
K_sin_delay_1 .set OA57Eh ; sin(-pi/4) 
K_sin_delay_2 .set 8000h ; Sin(-2*pi/4); 
K_2 .set 2h ; cicular buffer size 
K_256 .set 256 ; counter 
K_THETA .set 6487h 7; pi/4 
~text 
start: 
LD #d_sin_delayl, DP 
CALL cos_ start 
STM #d_sin_delayl,AR3 ; intialize the buffer 
RPTZ A,#3h 
STL A, *AR3+ 
STM #1, ARO 
STM #K_2, BK 
STM #K_256-1,BRC 
STM #d_sin_delayl,AR3 
ST #K_sin_delay_1, *AR3+% ; load calculated initial values of sin((n-1) ) 
ST #K_Sin_delay_2, *AR3+% ; load calculated initial values of sin((n-2) ) 


, this generates the sine_wave 
sin_generate: 


RPTB end_of_sine 
MPY *AR2, *AR3+0%,A ; cos(theta) *sin{ (n-1)theta} 
SUB *AR3,15,A ; 1/2*sin{ (n-2) theta) 
SFTA A,1,A ; sin(n*theta) 
STH A, *AR3 ; store 
end_of_sine 
NOP 
NOP 
B sin_generate 
end 
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Example 6-3. Generation of a Cosine Wave 


; Functional Description 

; this computes the cosine of an angle using the Taylor Series Expansion 
-mmregs 
.def d_x,d_squr_x,d_coff,d_cosx,C_7FFF 
.def cos_prog,cos_start 


STH A, *AR3 ; store 
-word O024ah Paes Ai ees) 
.word 0444h = I/5:/6 
-word Oaa9h ; 1/3.4 
d_x -usect "“cos_vars”,1 
d_squr_x -usect "cos_vars”,1 
d_cosx -usect "cos_vars”,1 
C_7FFF -usect "cos_vars”,1 
K_THETA .set 6487h ; pi/s4 
K_7FFF -set 7FFFH 
-Cext 
cos _ start: 
STM #d_coff, AR3 3c01=1/56, c2=1/30, c3=1/12 
STM #d_x, AR2 ; input theta 
STM #C_7FFF,AR4 ; Al, A2, A3, A4 
cos_prog: 
LD #d_x,DP 
ST #K_THETA, d_x ; input theta 
ST #K_7FFF, C_7FFF 
SQUR *AR2+,A ; let x*2 =P 
ST A, *AR2 ; AR2 - > x2 
|| Lp *AR4,B ; 
MASR *AR2+, *AR3+,B,A 3; (1-x%2)/72 
MPYA A 3 1-x*2 (1-x*2) /72 
0 me ee? 
STH A, *AR2 
MASR *AR2-, *AR3+,B,A ; A = 1-x%*2/42 (1-x%*2/72) 
3 T =x*2 (1-x%*2/72) 
MPYA *AR2Q+ ; B = A(32-16)*x%*2 
ST B, *AR2 ; 
|| Lp *AR4,B ;B=C_l 
MASR *AR2-, *AR3+,B,A ; A = 1-x*2/20 (1-x*2/42 (1-x*2/72) ) 
SFTA A,-1,A 3-1/2 
NEG A 
MPYA *AR2+ ; B = A(32-16) *x%*2 
RETD 
ADD *AR4,16,B 
STH B, *AR2 7; cos (theta) 
end 
mmregs 
.ref cos_prog, cos_start 
d_cos_delayl .usect "cos_vars”,1 
d_cos_delay2 .usect "cos_vars”,1 
d_theta -usect "cos_vars”,1 
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Example 6-3. Generation of a Cosine Wave (Continued) 


K_cos_delay_1 
K_cos_delay_2 
K_2 

K_256 

K_theta 


start: 
LD 
CALL 
CALL 
STM 
RPTZ 
STL 
STM 
ST 
ST 
STM 
ST 
STM 
STM 
STM 

cos_generate: 
RPTB 
MPY 
SUB 
SFTA 
STH 
PORTW 

end_of_cose 
NOP 
NOP 


PRELIMINARY 


-set 06ed9h 
.set 4000h 
.set 2h 
.set 256 
.set 4303h 
-text 


d_cos_delay1l,DP 
cos_start 
cos_prog 
d_cos_delay1,AR3 
A, #3h 
A, *AR3+ 
#d_cos_delayl,AR3 
K_cos_delay_1, *AR3+ 
K_cos_delay_2, *AR3 
#d_cos_delayl,AR3 
#K_ theta,d_theta 

#1, ARO 

K_2,BK 

K_256-1,BRC 


end_of_cose 
*AR2, *AR3+03,A 
*AR3,15,A 
A,1,A 

A, *AR3 
*AR3,56h 


cos_generate 


; cos (-pi/6) 

7; cos(-2*pi/6); 

; Cicular buffer size 

; counter 

; sin(pi/2-pi/6)= cos (pi/6) 
; cos (pi/2-pi/x) 

; .052= 4303h 


; calculate cos (theta) 


, output vaues 


; cos(theta) *cos{ (n-1)theta} 
; 1/2*cos{ (n-2) theta) 

7 cos (n*theta) 

; store 
; write to a port 


; next sample 
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6.3 Square Roots 


Example 6—4 uses a 6-term Taylor series expansion to approximate the square 
root of a single-precision 32-bit number. A normalized, 32-bit, left-justified num- 
ber is passed to the square root function. The output is stored in the upper half 
of the accumulator, and the EXP and NORM instructions normalize the input 
value. The EXP instruction computes an exponent value in a single cycle and 
stores the result in T, allowing the NORM instruction to normalize the number 
in a single cycle. If the exponent is an odd power, the mantissa is (multiplied by 
1 divided by the square root of 2) to compensate after finding the square root 
of the 32-bit number. The exponent value is negated to denormalize the number. 


ys = (i x 
where : 


X= y-l 


X i KO. DK LO 
2 8°16 128 256 


= xX _ X\2 X\3 _ X\4 X\5 
1+ 0.5(%) + 0.5(%) 0.626(%) + 0.875(3) 


=1+ 


where : 


05sx<1 


Example 6—4. Square Root Computation 


KKKK KK KKK KK KKK KKK KK KKK KKK KKK KKK KKK KKK KKK KKK KKK KKK KK KKK KKK KKK KK KKK KK KKK KKK KK KK 


* Six term Taylor Series is used here to compute the square root of a number 

* y*0.5 = (1+x)*0.5 where x = y-l 

* = 14+(x/2)-0.5* ( (x/2) *2+0.5* ( (x/2) *3-0.625* ( (x/2) *4+0.875* ( (x/2) *5) 

* 0.5 <= x<il 

KKK KK KKK KKK KK KKK KKK KK KKK KK KKK KK KKK KKK KK KKK KKK KK KKK KK KKK KKK KK KKK KKK KKKKKKKKKK KKK 


-mmregs 
-sect "squr_var” 
d_part_prod -word 0 
d_part_shift -word 0 
C_8000 -word 0 
C_sqrt_one_half .word 0 
d_625 -word 0 
d_875 -word 0 
tmp_rgl -word 0 
K_input -set 800h ; input # = 0.0625 
K_8000 -set 8000h ; =1 or round off bit 
K_4000 -set 4000h ; 0.5 coeff 
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Example 6-4. Square Root Computation (Continued) 


K_SORT_HALF .set 5a82h ; 1/sqrt2 
K_625 -set -20480 ; coeff 0.625 
K_875 -set 28672 ; coeff 0.875 
-text 
sqroot: 
LD #d_part_prod,DP 
ST #K_8000,C_8000 
ST #K_input, d_part_prod 
ST #K_SORT_HALF, C_sqrt_one_half 
ST #K_875,d0_875 
ST #K_625,d_625 
LD d_part_prod,16,A ; load the # 
EXP A 
nop ; dead cycle 
NORM A 
ADDS C_8000,A ; round off bit 
STH A, d_part_prod ; normalized input 
LDM T,B 
SFTA B,-1,B ; check for odd or even power 
BCD res_even,NC 
NE B ; negate the power 
STL B,d_part_shift ; this shift is used to denormalize the # 
LD d_part_prod,16,B ; load the normalized input # 
CALLD sq_root ; square root program 
ABS B 
NOP ; cycle for delayed slot 
LD B,A H 
BD res_common 
SUB B,B 7; zero B 
MACAR C_sqrt_one_half,B ; square root of 1/2 


; odd power 
res_even 


LD d_part_prod,16,B 

CALLD sq_root 

ABSB 

NOP 7 cycle for the delayed slot 
res_common 

LD d_part_shift,T ; vight shift value 

RETD 

STH B,d_part_prod 

LD d_part_prod,TS,A ; denormaliize the # 
sq_root: 

SFTA B,-1,B ; x/2 = y-1/2 

SUB #K_4000,16,B,B 

STH B,tmp_rgl ; tmp_rgl = x/2 

SUB #K_8000,16,B ; B= 1+x/2 

SQUR tmp_rgl1,A ; A (x/2)%2, T = x/2 

NEG A ; A= -A 

ADD A,-1,B ; B= 1+x/2-.5(x/2)%*2 
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Example 6—4. Square Root Computation (Continued) 


6-16 


SQUR 
MACA 


A,A 
d_625,B 


tmp_rg1,T 
A 

d_875,B 
tmp_rgl1,A 
A 


A,-1,B 
C_8000,B 


r 


Ne Ne Ne Ne Ne Ne Ne 


A = (x/2)%4 
0.625*A+B 

T =0.625 

T = x/2 
(x/2)*4*x/2 
0.875*A+B 
x/2°2; T = x/2 
A = x/2*x/2°2 


round off bit 
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6.4 Extended-Precision Arithmetic 


PRELIMINARY 


Numerical analysis, floating-point computations, and other operations may 
require arithmetic operations with more than 32 bits of precision. Since the 54x 
devices are 16/32-bit fixed-point processors, software is required for arithmetic 
operations with extended precision. These arithmetic functions are performed 
in parts, similar to the way in which longhand arithmetic is done. 


The ’54x has several features that help make extended-precision calculations 
more efficient. One of the features is the carry bit, which is affected by most 
arithmetic ALU instructions, as well as the rotate and shift operations. The 
carry bit can also be explicitly modified by loading STO and by instructions that 
set or reset status register bits. For proper operation, the overflow mode bit 
should be reset (OVM = 0) to prevent the accumulator from being loaded with 
a saturation value. 


The two ’54x internal data buses, CB and DB, allow some instructions to han- 
dle 32-bit operands ina single cycle. The long-word load and double-precision 
add/subtract instructions use 32-bit operands and can efficiently implement 
multi-precision arithmetic operations. 


The hardware multiplier can multiply signed/unsigned numbers, as well as 
multiply two signed numbers and two unsigned numbers. This makes 32-bit 
multiplication efficient. 
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6.4.1 


Addition and Subtraction 


The carry bit, C, is setin STO if a carry is generated when an accumulator value 
is added to: 


Lj) The other accumulator 
Lj Adata-memory operand 
Lj An immediate operand 


A carry can also be generated when two data-memory operands are added 
or when a data-memory operand is added to an immediate operand. If a carry 
is not generated, the carry bit is cleared. 


The ADD instruction with a 16-bit shift is an exception because it only sets the 
carry bit. This allows the ALU to generate the appropriate carry when adding 
to the lower or upper half of the accumulator causes a carry. 


Figure 6-1 shows several 32-bit additions and their effect on the carry bit. 


Figure 6—1. 32-Bit Addition 


6-18 


C MSB LSB C MSB LSB 

X FFFFFF FF F F ACC X FFFFFF FF F F ACC 
+ 1 +F FE FF 

T -@ 0:10:10: 100-00 0-0 1 FFFFFFFFFE 

C MSB LSB C MSB LSB 

X OOT7F FF FF F F ACC xX 00 FF F F F F F ACC 
+ 1 +F F FF FFFFFE 

0 0080000000 1 OOT7FFFFFFE 

C MSB LSB C MSB LSB 

X FF 80000000 ACC X FF 8 0 00 0 ACC 
+ 1 +F FFF FF FFEFE 

0 FF8BO00000 01 1 FFTUVFFFFFFE 

ADDC 

C MSB LSB C MSB LSB 

1 000000000 0 ACC 1 FFFFFFFFF F ACC 
+ Q (ADDC) + Q_ (ADDC) 

0 0000000001 10000000000 

ADD Smem,16,src 

C MSB LSB C MSB LSB 

1 FF 8000F F F F ACC di FF 8000F F F F ACC 
+40 000010000 +40 O 7TFFFOOOO 

1 FFS8BOOI1FFFF 1 FFFFFFFFFE 
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Example 6-5 adds two 64-bit numbers to obtain a 64-bit result. The partial 
sum of the 64-bit addition is efficiently performed by the DLD and DADD 
instructions, which handle 32-bit operands in a single cycle. For the upper half 
of apartial sum, the ADDC (ADD with carry) instruction uses the carry bit gen- 
erated in the lower 32-bit partial sum. Each partial sum is stored in two memory 
locations by the DST (long-word store) instruction. 


Example 6-5. 64-Bit Addition 


PRELIMINARY 


PIES OEE TEE ETE PT TS ETE EE ETE TT Te EEE TO ST EES SP Te TE ETE 
; 64-bit Addition 
v 
; X3 X2 X1 XO 
; + Y3 Y2 Y1 YO 
¥ 
; W3 W2 W1 WO 
TEREEELEL EEE DOE LEEPELEL EEE CREE E DE DEEL ELE EEL EDR DED EELBELLE? 
ADD64: DLD @X1,A ;A = X1 x0 
DADD @Y1,A ;A = X1 XO + Y1 YO 
DST A,Q@W1 
DLD @X3,A ;A = X3 X2 
ADDC @Y2,A ;A = X38 X2 + 00 Y2+C 
ADD @y3,16,A ;A = X3 X2 + Y3 Y2+C 
DST A,Q@W3 
R 


Similar to addition, the carry bit is reset if a borrow is generated when an accu- 
mulator value is subtracted from: 


Lj The other accumulator 
_j A data-memory operand 
Lj An immediate operand 


A borrow can also be generated when two data-memory operands are sub- 
tracted or when an immediate operand is subtracted from a data-memory 
operand. If a borrow is not generated, the carry bit is set. 


The SUB instruction with a 16-bit shift is an exception because it only resets 
the carry bit. This allows the ALU to generate the appropriate carry when sub- 
tracting from the lower or the upper half of the accumulator causes a borrow. 


Figure 6—2 shows several 32-bit subtractions and their effect on the carry bit. 
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Figure 6—2. 32-Bit Subtraction 


C MSB LSB C MSB LSB 

X 00000000 0 0 ACC X F F 000000 0 0 ACC 
= 1 = 5 Be Bi es ES (Ee a Bee Ro 

O FFFFFFFEFFEFFEFE 0 0000000001 

C MSB LSB C MSB LSB 

x 007 F FF F F F F ACC xX 007 F F F F F F F ACC 
= 1 =F. Fe UE Ea 

1 OOTVFFFFFFE C FF80O0000 0 0 

C MSB LSB C MSB LSB 

X F F 800000 0 0 ACC X F F 800000 0 0 ACC 
= 1 =F. BB EEE 

il FFTV7VFFFFFFE 0 FF 0 01 

SUBB 

C MSB LSB C MSB LSB 

0 00000000 0 0 ACC O FFFFFF FFE F F Acc 
= QO (SUBB) = 0 (SUBB) 

0 | CoS clog A ci coe cs cel ca 1 BOE BO Be Be Pe Be BO abe BG 


C MSB LSB C MSB LSB 

1 FF8000F F F F ACC 0 FF 8000F F F F ACC 
“O00 00010000 =F_LE F FF F000 10 

0 OOTFFFEFFEFEFEE Oo FFS8BOOI1LFFEFFE 
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Example 6-6 subtracts two 64-bit numbers on the ’54x. The partial remainder 
of the 64-bit subtraction is efficiently performed by the DLD (long word load) 
and the DSUB (double precision subtract) instructions, which handle 32-bit 
operands in a single cycle. For the upper half of a partial remainder, the SUBB 
(SUB with borrow) instruction uses the borrow bit generated in the lower 32-bit 
partial remainder. Each partial remainder is stored in two consecutive memory 
locations by a DST. 


Example 6-6. 64-Bit Subtraction 


= Y¥3) ¥2. -Y1--Y0 


’ 
Ul 
’ 
’ 
; X3 X2 X1 XO 
’ 
’ 
’ 


W3 W2 W1 WO 


| ie fae ae a A A A A A A A A A A A A A A A A A A A A A A 


DLD @X3,A ;A = X3 X2 

SUBB @Y2,A ;A = X3 X2 - 00 Y2 - (inv C) 
DST A,@W1 

SUB @Y3,16,A ;A = X3 X2 - Y3 Y2 - (inv C) 
DST  A,@W3 

RE 


6.4.2 Multiplication 


The MPYU (unsigned multiply) and MACSU (signed/unsigned multiply and accu- 
mulate) instructions can also handle extended-precision calculations. 


Figure 6—3 shows how two 32-bit numbers obtain a 64-bit product. The MPYU 
instruction multiplies two unsigned 16-bit numbers and places the 32-bit result 
in one of the accumulators in a single cycle. The MACSU instruction multiplies 
a signed 16-bit number by an unsigned 16-bit number and accumulates the 
result in a single cycle. Efficiency is gained by generating partial products of 
the 16-bit portions of a 32-bit (or larger) value instead of having to split the val- 
ue into 15-bit (or smaller) parts. 
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Figure 6-3. 32-Bit Multiplication 


Unsigned multiplication 


X1 x YO 


Signed/unsigned multiplication 


X0 x Y1 


Signed/unsigned multiplication 


X1 x Y1 


+ Signed multiplication 


Final 64-bit result 


The program in Example 6—7 shows that a multiply of two 32-bit integer num- 
bers requires one multiply, three multiply/accumulates, and two shifts. The 
product is a 64-bit integer number. Note in particular, the use of MACSU, 
MPYU and LD instructions. The LD instruction can perform a right-shift in the 
accumulator by 16 bits in a single cycle. 


Example 6-8 performs fractional multiplication. The operands are in Q31 format, 
while the product is in Q30 format. 
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Example 6-7. 32-Bit Integer Multiplication 


rPrrrrrrrrrrrrrr reer 
This routine multiplies two 32-bit signed integers 
resulting; in a 64-bit product. The operands are fetched 
from data memory and the result is written back to data 


memory. 
Data Storage: 
X1,X0 32-bit operand 
Y1,Y0 32-bit operand 


W3,W2,W1, WO 
Entry Conditions: 
SXM = 1, OVM = 0 


64-bit product 


Prrrrrrvrrrrrrer reer 


STM #X0,AR2 ;AR2 = XO addr 

STM #Y0,AR3 ,;AR3 = YO addr 

LD *AR2,T 1T = XO 

MPYU *AR3+,A ;A = X0*YO0 

STL A, @WO ;save WO 

LD A,-16,A j;A =A >> 16 

MACSU *AR2+,*AR3-,A  j;A = XO*YO>>16 + XO¥*Y1 

MACSU *AR3+,*AR2,A 7A = X0O*YO>>16 + XO*Y1 + X1*YO 
STL A, @W1 ;save WL 

LD A,-16,A j;A =A >> 16 

MAC *AR2, *AR3,A ;A = (XO*YL + X1*Y0)>>16 + X1*Y1 
STL A, @W2 ;save W2 

STH A, @W3 ;save W3 


Example 6-8. 32-Bit Fractional Multiplication 


PRELIMINARY 


| A fe fe ee ae a A A A A A A A A A A A A A A A 


This routine multiplies two Q31 signed integers 
resulting in a Q30 product. The operands are fetched 
from data memory and the result is written back to data 


memory. 
Data Storage: 
X1,X0 Q31 operand 
Y1,Y0 Q31 operand 
w1,WO Q30 product 


Entry Conditions: 
SXM = 1, OVM = 0 


| A fe fae ee ae A fe A A A A A A A A A A A A A A A A A A A A A A A A 


STM #X0, AR2 ;AR2 = XO addr 
STM #Y1,AR3 ;AR3 = Y1 addr 

LD #0,A ;clear A 

MACSU *AR2+,*AR3-,A  j;A = XO*Y1 

MACSU *AR3+,*AR2,A 7A = XO*Y1 + X1*YO 
LD A,-16,A j;A =A >> 16 

MAC *AR2, *AR3,A jA =A + X1*Y1 


STL A, @wWO 
STH A, @W1 


;save lower product 
;save upper product 
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6.5 Floating-Point Arithmetic 
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In fixed-point arithmetic, the binary point that separates the integer from the 
fractional part of the number is fixed at a certain location. For example, if a 
32-bit number places the binary point after the most significant bit (which is 
also the sign bit), only fractional numbers (numbers with absolute values less 
than 1), canbe represented. The fixed-point system, although simple to imple- 
ment in hardware, imposes limitations in the dynamic range of the represented 
number. You can avoid this difficulty by using floating-point numbers. 


A floating-point number consists of a mantissa, m, multiplied by a base, b, 
raised to an exponent, e, as follows: 


m * be 


To implement floating-point arithmetic on the 54x, operands must be con- 
verted to fixed-point numbers and then back to floating-point numbers. Fixed- 
point values are converted to floating-point values by normalizing the input 
data. 


Floating-point numbers are generally represented by mantissa and expo- 
nent values. To multiply two numbers, add their mantissas, multiply the expo- 
nents, and normalize the resulting mantissa. For floating-point addition, shift 
the mantissa so that the exponents of the two operands match. Left-shift the 
lower-power operand by the difference between the two exponents. Add the 
exponents and normalize the result. 


Figure 6—4 illustrates the IEEE standard format to represent floating-point 
numbers. This format uses sign-magnitude notation for the mantissa, and the 
exponent is biased by 127. In a 32-bit word representing a floating-point num- 
ber, the first bit is the sign bit, represented by s. The next eight bits correspond 
to the exponent, which is expressed in an offset-by-127 format (the actual ex- 
ponent is e—-127). The following 23 bits represent the absolute value of the 
mantissa, with the most significant 1 implied. The binary point is placed after 
this most significant 1. The mantissa, then, has 24 bits. 
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PRELIMINARY Floating-Point Arithmetic 


Figure 6—4. IEEE Floating-Point Format 


8 23 


1 


The values of the numbers represented in the IEEE floating-point format are 
as follows: 


(-1)$ * 2e-127 * (01.f) IfO<e< 255 


Special Cases: 


(-1)§ * 0.0 If e = 0, and f = 0 (zero) 

(-1)§ * 2-126 * (0.f) If e = 0 and f <> 0 (denormalized) 
(—1)S * infinity If e = 255 and f = 0 (infinity) 

NaN (not a number) If e = 255 andf <>0 


Example 6-9 through Example 6-11 illustrate how the 54x performs floating- 
point addition, multiplication, and division. 


Example 6-9. Add Two Floating-Point Numbers 


Ks KKK KK KKK KK KKK KEK KKK KKK KK KKK KK KKK KKK KK KKK KKK KEK KKK KKK KKK KKK KKK KKK KKKKKKKK KKK KK KKK 
*; FLOAT_ADD - add two floating point numbers 

*; Copyright (c) 1993-1994 Texas Instruments Incorporated 

*; NOTE: The ordering of the locals are placed to take advantage of long word 

* 

* 


; loads and stores which require the hi and low words to be at certain addresses. 
; Any future modifications which involve the stack must take this quirk into 


x account 
xy KKK KK KKK KK KKK KKK KEK KKK KK KKK KK KKK KKK KEK KKK KK KKK KK KKK KKK KEK KKK KKK KEK KK KKK KKKKK KKK KK 


KKEKKKKKKKKKKKKKKKKKKK KKK KKK KKK KKK KKK KKK KKK KKK KKK KKK KKK KKK KKK KKK KKK KKK KKK KK KKKKKKKEK 


,Operand 1 (OP1) and Operand (OP2) are each packed into sign, exponent, and the 
;words of mantissa. If either exponent is zero special case processing is initiated. 
;In the general case, the exponents are compared and the mantissa of the lower 
;exponent is renormalized according to the number with the larger exponent. The 
;mMantissas are also converted to a two’s complement format to perform the actual 
;addition. The result of the addition is then renormalized with the corresponding 
;adjustment in the exponent. The resulting mantissa is converted back to its 
;original sign-magnitude format and the result is repacked into the floating point 


; representation. 
KKEKKKKKKKKKKKKKKKK KKK KKK KKK KKK KKK KKK KKK KKK KKK KKK KKK KKK KK KKK KKK KKK KKK KKK KKK KK KKKKKE 


es KKEKKKKKKKKKKKKKKKKKKK KKK KKK KKK KKK KKK KKK KKK KKK KKK KKK KKK KKK KKK KKK KKK KKK KKK KKKKK 


a resource utilization: B accumulator, T-register 
ne status bits affected: TC, C, SXM, OVM, 
a ntry requirements : CPL bit set 


ey KKK KKK KKK KKK KKK KK KKK KKK KKK KK KKK KKK KKK KKK KKK KKK KK KKK KKK KKK KKK KKK KK KKK KKK KKK KK 
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Example 6-9. Add Two Floating-Point Numbers (Continued) 


; Floating Point Format - Single Precision 

*) 31) 30 | 29 | 28) 27 | 26| 25 | 24 | 23 | 22 | 21 | 20|19| 18/17 | 16 | 
Oe ap ee |e ee ee ae eo | es ee es | 
(aI ARTE OE IAS NE ERROR TICE TENET SINR ROR ROIEE ITI OE ERS TEI 
*| 15 | 14 | 13 | 12 | 11 | 10 | 9 | 8] 7{ 6 5 4 3 | 2] 11] =0 


Sc a a a a a a Se a a a a a SS Se a a ee SS a a Se a Se ee 


*; Single precision floating point format is a 32 bit format consisting of a 1 bit 
sign field, an 8 bit exponent field, and a 23 bit mantissa field. The fields are 
defined as follows 


*; Sign <S> : O = positive values; 1 = negative value 

*; Exponent <E7-E0 : offset binary format 

*s 00 = special cases (i.e. zero) 

Bae 01 = exponent value + 127 = -126 

* FE = exponent value + 127 = +127 

#5 FF = special cases (not implemented) 

*; Mantissa <M22-M0> : fractional magnitude format with implied 1 

*; 1.M22M21...M1MO 

eS Range : —-1.9999998 e+127 to -1.0000000 e-126 
*; +1.0000000 e-126 to +1.9999998 e+127 
a (where e represents 2 to the power of) 
hades —3.4028236 e+38 to -1.1754944 e-38 
* +1.1754944 e-38 to +3.4028236 e+38 


~ 


a: (where e represents 10 to the power of) 
EE RELEAREREERER EERE EEE EER RE ERR EER ER EER EER RE EERE ER RR BER RBA RR RRR ET RB AER RBA RR BRR BBR 


res_hm -usect "“flt_add”,1 ; result high mantissa 
res_1m -usect "“flt_add”,1 ; result low mantissa 
res_exp -usect “flt_add”,1 ; result exponent 
res_sign -usect “flt_add”,1 ; result sign 
op2_hm -usect "“flt_add”,1 ; OP2 high mantissa 
op2_lim -usect "“flt_add”,1 ; OP2 low mantissa 
op2_se -usect “flt_add”,1 ; OP2 sign and exponent 
opl_se -usect “flt_add”,1 ; OP1l sign and exponent 
op1_hm -usect "“flt_add”,1 ; OPl high mantissa 
op1_im -usect "“flt_add”,1 ; OP1 low mantissa 
opl_msw -usect "“flt_add”,1 ; OP1l packed high word 
opl_lsw -usect “flt_add”,1 7 OP1l packed low word 
op2_msw -usect "“flt_add”,1 7 OP2 packed high word 
op2_lsw -usect "“flt_add”,1 7 OP2 packed low word 
err_no -usect "“flt_add”,1 : 

-mmregs 


KKK KKK KKK KK KKK KKK KKK KKK KK KKK KKK KKK KKK KKK KKK KKK KKK KKK KKK KKK KK KK KKK KK 


* Floating point number 12.0 can be represented as 1100 = 1.100 x 23 => sign =0 
* biased exponent = 127+3 = 130 

* 130 = 10000010 

7 Mantissa 10000000000000000000000 

* Thus 12.0 can be represented as 01000001010000000000000000000000= 4140h 

*x 


KKRKKK KKK KKK KKK KKK KK KKK KKK KKK KK KKK KKK KKK KKK KKK KKK KK KKK KKK KK KKK KKK KKK KKK KKK KKK KK KKK 
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Example 6-9. Add Two Floating-Point Numbers (Continued) 
K_OP1_HIGH -set 4140h , floating point number 12.0 
K_OP1_LOW -set 0000h 
K_OP2_HIGH .set 4140h , floating point number 12.0 
K_OP2_LOW .set 0000h 

-mmregs 

sCeExt 
start_flt: 

RSBX C16 

LD #xres_hm,DP ; initialize the page pointer 

LD #K_OP2_HIGH,A ; load floating #2 - 12 

STL A,op2_msw 

LD #K_OP2_LOW,A 

STL A,op2_lsw 

LD #K_OP1_HIGH,A ; load floating #1 - 12 

STL A,opl_msw 

LD #K_OP1_LOW,A 

STL A,opl_lsw 


~ 


oN 


~ 


~ 


r 


+ + F F FF F F HF 
~ 


~ 


oN 


New 


+ + + F F F F F HF F 
~ ~ 
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Test OP1l for special case treatment of zero. 
Split the MSW of OP1l in the accumulator. 


p RRR RRR KKK KR KKK KKK KKK KK KKK KKK KKK KKK KEK KKK KK KKK KKK KKK KKK KEK KKK KK EK KR KK KKK KKK KE KK 


CONVERSION OF FLOATING POINT FORMAT - UNPACK 


Save the exponent on the stack [xxxx xxxx EE 
Add the implied one to the mantissa value. 
Store the mantissa as a signed value 


EB]. 


KKK KKK KKK KKK KKK KK KKK KKK KKK KKK KKK KKK KKK KKK KK KK KKK KKK KKK KK KKK KKK KKK KKK KKK KKK KKK 


DLD opl_msw,A ; load the OP1 high word 

SFTA A,8 , shift right by 8 

SFTA A,-8 

BC opl_zero, AEQ , If opl is 0, jump to special case 

LD A,B 7 Copy OP1 to acc B 

RSBX SXM , Reset for right shifts used for masking 
SFTL A,1 ; Remove sign bit 

STH A,-8,opl_se ; Store exponent to stack 

SFTL A,8 ;, Remove exponent 

SFTL A,-9 

ADD #080h,16,A ; Add implied 1 to mantissa 

XC 1,BLT ; Negate OP1 mantissa for negative values 
NEG A 

SSBX SXM ; Make sure OP2 is sign-extended 

DST A,opi_hm ; Store mantissa 


Test OP1l for special case treatment of zero. 
Split the MSW of OP1 in the accumulator. 


JO III I ICICI I III I IIIS II ICIOIOI I II IICIOI III III I I ICICI I I I III I A I I I I a 
CONVERSION OF FLOATING POINT FORMAT - UNPACK 


Save the exponent on the stack [xxxx xxxx EE 
Add the implied one to the mantissa value. 
Store the mantissa as a signed value 


EB]. 


p RRR RRR RRR KR KKK KKK KE KK KKK KKK KKK KK KKK KKK KKK KKK KKK KKK KKK KK EK KR KKK KEK KR KK KKK KEK KEE KK 
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Example 6-9. Add Two Floating-Point Numbers (Continued) 


DLD op2_msw,A ; Load acc with op2 

BC op2_zero, AEQ ; If op2 is 0, jump to special case 

LD A,B 7 Copy OP2 to acc B 

SFTL A,1 ; Remove sign bit 

STH A,-8,op2_se ; Store exponent to stack 

RSBX SXM ; Reset for right shifts used for masking 
SFTL A,8 ; Remove exponent 

SFTL A,-9 

ADD #080h,16,A ; Add implied 1 to mantissa 

XC 1,BLT ; Negate OP2 mantissa for negative values 
NEG A 

SSBX SXM ; Set sign extension mode 

DST A,op2_hm ; Store mantissa 


RK RR KKK KR K KR KKK KK KK KKK KEK KK KK KKK KEK KK KK KKK KR KK KKK KKK KK KK KEK KEK KKK KEKE KK ERK KKK KEK 


EXPONENT COMPARISON 
Compare exponents of OP1 and OP2 by subtracting: exp OP2 - exp OP1 
; Branch to one of three blocks of processing 
Case 1: exp OP1 is less than exp OP2 
Case 2: exp OP1 is equal to exp OP2 


Case 3: exp OP1 is greater than exp OP2 
eC KKK KKK KK KKK KKK KKK KK KKK KK KKK KKK KK KKK KKK KKK KKK KKK KKK KKK KKK KKKKKKKKKKKK KKK KKK KKK 


~ 


~ 


~ 


+ £ F FF F F 
New ~ 


LD opl_se,A ; Load OP1 exponent 
LD op2_se,B ; Load OP2 exponent 
* 
SUB A,B ; Exp OP2 - exp OP1 --> B 
BC opl_gt_op2, BLT ; Process OP1l > OP2 
BC op2_gt_op1,BGT ; Process OP2 > OP2 


* 

He eREKR EERE KEKE EERE EKER ERR ERE EERE ERE ERE EERE EERE ERERERER EKER RERERERE EERE KEKE 
7 exp OP1 = exp OP2 

*; Mantissas of OP1 and OP2 are normalized identically. 

*; Add mantissas: mant OP1l + mant OP2 

*; If result is zero, special case processing must be executed. 


*; Load exponent for possible adjustment during normalization of result 
EE RRERRRER ERE ERR ERR EERE EH RR BERR BR RRB ER BI RH BRR BIRR BRR BR RB RR BR RRB RRR BR RR BR 


a_eq_b 
DLD opl_hm,A ; Load OP1 mantissa 
DADD op2_hm,A ; Add OP2 mantissa 
BC res_zero, AEQ ; If result is zero, process special case 
LD opl_se,B ; Load exponent in preparation for normalizing 


KKK KKK KKK KKK KKK KK KKK KKK KK KKK KKK KK KKK KKK KKK KKK KKK KKK KK KKK KKK KKK KKK KKK KKK KKK KKK 


normalize THE RESULT 

Take the absolute value of the result. 

Set up to normalize the result. 

The MSB may be in any of bits 24 through 0. 

Left shift by six bits; bit 24 moves to bit 30, etc. 


Normalize resulting mantissa with exponent adjustment. 
fF RRRRA RAKE RAKE ERK EREKE EEK ERA KK EEK EREKE EEK ER ERE EEK EREKE EEK ER ERE EEK EREKE EEK ERE KE 


’ 


~ 


ooNe 


Nese 


+ + F FF FF F F FH 
~ ~ 
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normalize 
STH A,res_sign ; Save signed mantissa on stack 
ABS A ; Create magnitude value of mantissa 
SFTL A,6 ; Pre-normalize adjustment of mantissa 
EXP A , Get amount to adjust exp for normalization 
NOP 
NORM A ; Normalize the result 
ST T, res_exp ; Store exp adjustment value 
ADD #1,B ; Increment exp to account for implied carry 
SUB res_exp,B ; Adjust exponent to account for normalization 


p RRR RRR RRR KR KKK KEK KKK KK KKK KKK KKK KKK KEK KKK KK KKK KKK KKK KEK KKK KR KKK KEK KE KK KKK KEK KEK K 


* 

* 

ay POST-NORMALIZATION ADJUSTMENT AND STORAGE 
*; Test result for underflow and overflow. 
* 
* 
* 


; Right shift mantissa by 7 bits. 
7 Mask implied 1 
; Store mantissa on stack. 
xs KKK KK KKK KEK KKK KKK KEK KKK KK KKK KK KKK KKK KEK KKK KK KKK KK KKK KKK KEK KKK KKK KEK KK KKK KKKKK KKK KK 


* 


normalized 
STL B, res_exp ; Save result exponent on stack 
BC underflow, BLEQ 7 process underflow if occurs 
SUB #O0FFh,B ; adjust to check for overflow 
BC overflow, BGEQ 7 process overflow if occurs 
SFTL A,-7 , Shift right to place mantissa for splitting 
STL A,res_lm ; Store low mantissa 
AND #07F00h,8,A ; Eliminate implied one 
STH A, res_hm ; Save result mantissa on stack** 


SRRERERERE LER RER ERE EER ER ER ER ER ER ER ERE EER ER ER ER ER ER ER ER ERE BER ER ER ERE REE ER ER ERE 
xy KKK KKK KKK KKK KK KKK KKK KKK KKK KK KKK KKK KKK KKK KK KKK KKK KKK KKK KKK KKK KKK KKK KKK KK KKK KKK 


*; CONVERSION OF FLOATING POINT FORMAT - PACK 

*; Load sign. 

*; Pack exponent. 

*; Pack mantissa. 

Ke REKERERERERERERER EKER ERE RERERERER EKER RARER ERE EERE KE EKER ERK EER ERERERERERERKERERES 

* 
LD res_sign,9,A 7 0000 000S 0000 0000 0000 0000 0000 0000 
AND #100h,16,A 
ADD res_exp,16,A ; 0000 000S REEE EEEE 0000 0000 0000 0000 
SFTL A,7 ; SEEE EEEE E000 0000 0000 0000 0000 0000 
DADD res_hm,A ; SEEE EEEE EMMM MMMM MMMM MMMM MMMM MMMM 


PRR RRR KR RK KK KK KEK KEK KKK KK KKK KKK KKK KKK KKK KKK KK KKK KKK KKK KEK KK KK KEK KEK KKK KEK KE KK 


* 
* 
*; CONTEXT RESTORE 
* 
* 


7 Pop local floating point variables. 


; Restore contents of B accumulator, T Register 
Ks KKK KKK KKK KKK KKK KKK KKK KK KKK KK KKK KKK KEK KKK KKK KKK KKK KKK KKK KKK KK KKEKKKKKKKKK KK KKK KK 


* 


return_value 
NOP 


RET 


PRELIMINARY Arithmetic and Logical Operations 6-29 


Floating-Point Arithmetic PRELIMINARY 


Example 6-9. Add Two Floating-Point Numbers (Continued) 


KKK KKK KKK KKK KK KKK KKK KKK KK KKK KKK KKK KKK KKK KK KKK KKK KKK KKK KKK KKK KKK KKK KKK KKK KK KKK 


exp OP1l > exp OP2 


ke 

: 
*; 

*; Test if the difference of the exponents is larger than 24 (precision of the mantissa) 
*; Return OP1 as the result if OP2 is too small. 

*; Mantissa of OP2 must be right shifted to match normalization of OP1 

* 


; Add mantissas: mant OP1 + mant op2 
RIERA ASA BER ODS Wate PA A Dee Bake Tee TA Ge OCEANA De eR Da ee I Te A RP Te Ae Rage Mecmie SM M Ran Ce A ee ene Ba ae Mae gee 


* 


opl_gt_op2 
ABS B ; If exp OP1l >= exp OP2 + 24 then return OP1 
SUB #24,B 
BC return_op1, BGEQ 
ADD #23,B ; Restore exponent difference value 
STL B, res_sign ; Store exponent difference to be used as RPC 
DLD op2_hm,A ; Load OP2 mantissa 
RPT res_sign ; Normalize OP2 to match OP1 
SFTA A,-1 
BD normalize ; Delayed branch to normalize result 
LD opl_se,B ; Load exponent value to prep for normalization 
DADD opl_hm,A ; Add OP1 to OP2 


* 

EE RREKRRERRREER EER ARERR RE REE ERR ER REE REE ER REE RRR ERR ERE EKER KERR EKER ERE RE ERS ER 
*; OP1 < OP2 

*; Test if the difference of the exponents is larger than 24 (precision of the mantissa). 
*; Return OP2 as the result if OP1l is too small. 

*; Mantissa of OP1l must be right shifted to match normalization of OP2. 


*; Add mantissas: mant OP1l + mant OP2 
HP RREKRRERRRERA EER ER ERR ER ER ERR RE RE EER EERE ERR EER ER ERR EER ER EERE ER ER ERR EER ER EERE 


op2_gt_opl 
SU B #24,B ; If exp OP2 >= exp OP1 + 24 then return OP2 
BC return_op2, BGEQ 
ADD #23,B ; Restore exponent difference value 
STL B,res_sign ; Store exponent difference to be used as RPC 
DLD opl_hm,A ; Load OP1 mantissa 
RPT res_sign ; Normalize OP1l to match OP2 
SFTA A,-1 BD normalize ; Delayed branch to normalize result 
LD op2_se,B ; Load exponent value to prep for normalization 
DADD op2_hm,A ; Add OP2 to OP1 
KR eeKEKK EERE EKE EERE EKER EKER EKER EKER ERE EERE ERE EERE EERE EKER ERE EERE EERE EER ER ERE 
*; OP1 << OP2 or OP1 = 0 


Ky RK RK KKK KEK KKK KKK KR KKK KEK KR KK KKK KKK KKK KKK KK KKK KKK KKK KEK KKK KKK KEK KKK KEK KEK KKK 
* 


return_op2 


opl_zero 
BD return_value 
DLD op2_msw,A ; Put OP2 as result into A 
NOP 


* 

Ee RREKERERER EERE ER REE RR EER ERE RRR ER ERE REE RRR EERE ER RR ERR ER ER KERR ER RR EERE BK RR KER 
*e OP1 << OP2 or OP1 = 0 

FREER ERRRERR EER ER ERE EER ER EKER ER ERE REE ER ER EKER ERE ERE EER ER RRR ER ER ERE EER ER EER 
* 
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Example 6-9. Add Two Floating-Point Numbers (Continued) 


op2_zero 
return_opl 
DLD opl_hm,A ; Load signed high mantissa of OP1 
BC opl_pos,AGT ; If mantissa is negative : 
NEG A ; Negate it to make it a positive value 
ADDM #100h,opl_se ; Place the sign value back into opl_se 
opl_pos 
SUB #80h,16,A ; Eliminate implied one from mantissa 
LD opl_se,16,B , Put OP1 back together in acc A as a result 
BD return_value 
SFTL B,7 
ADD B,A 
xy KKK KK KKK KEK KKK KKK KEK KK KKK KKK KK KKK KK KKK KKK KK KKK KKK KK KKK KEK KKK KKK KEK KKK KKKKKKK KKK KK 
a overflow PROCESSING 
* 


; Push errno onto stack. 


*; Load accumulator with return value. 
xs KEK KK KKK KK KKK KKK KEK KKK KK KKK KK KKK KK KKK KKK KK KKK KK KKK KKK KEK KKK KKK KEK KKK KKKKKKKKKEKK 


* 


overflow 
ST #2,err_no ; Load error no 
LD res_sign,16,A ; Pack sign of result 
AND #8000,16,A ; Mask to get sign 
OR #OFFFFh,A ; Result low mantissa = OFFFFh 
BD return_value ; Branch delayed 
ADD #O07F7Fh,16,A ; Result exponent = OFEh 


, Result high mant = O7Fh 
KKK KK KKK KK KKK KK KKK KKK KK KKK KKK KK KKK KK KKK KKK KEK KKK KKK KEK KK KKK KEK KKK KKK KK KKK KKKKKKKKK 
me underflow PROCESSING 
*; Push errno onto stack. 


*; Load accumulator with return value. 
xy KEK KK KK KKK KKK KK KKK KK KKK KKK KK KKK KK KKK KKK KK KKK KKK KK KKK KEK KKK KKK KE KKKKKKKKKKK KKK KK 


* 


underflow 
ST #1,err_no ; Load error no 
RET 
res_zero 
BD return_value ; Branch delayed 
SUB A,A ; For underflow result = 0 
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Example 6—10. Multiply Two Floating-Point Numbers 


Kp RRR KKK KK KKK KKK KR KK KKK KKK KK KK KKK KKK KK KK KKK KER KR KK KKK KEK KR KKK KKK KE KKK KEK KKK KEK KKK 


*; Float_MUL - multiply two floating point numbers 
*; Copyright (c) 1993-1994 Texas Instruments Incorporated 


RRR KKK KK KKK KKK KKK KR KKK KKK KK KK KKK KKK KK KK KKK KER KK RK KKK KKK KKK KKK KE KKK RK KKK KKK KKK 
KR RRR KK KKK KKK KKK KKK KR KKK KKK KE KK KKK KEK KK KK KKK KEK KK RK KKK KEK KR KKK KKK KEK KKK KERR KEK KKK KER 


;This routine multiplies two floating point numbers. OP1 and OP2 are each unpacked 
;into sign, exponent, and two words of mantissa. If either exponent is zero 

7; special case processing is initiated. The exponents are summed. If the result is 
;less than zero underflow has occurred. If the result is zero, underflow may have 
;occurred. If the result is equal to 254 overflow may have occurred. If the result 
jis greater than 254 overflow has occurred. Underflow processing returns a value 
7;of zero. Overflow processing returns the largest magnitude value along with the 
yappropriate sign. If no special cases are detected, a 24x24-bit multiply is 
7executed. The result of the exclusive OR of the sign bits, the sum of the 


7exponents and the ;24 bit truncated mantissa are packed and returned 
EER RAKK EEK ERE KE EEK ER ERE KEKE RE KE EKER EKER EEK ERE KE REKEREKE KEKE RE KK EKER ERE RE ER ERE 


Bee resource utilization: B accumulator, T-register 

* status bits affected: TC, C, SXM, OVM, C16 

ay entry requirements : CPL bit set 

Ee RKEKE EKA ERE RERER ERE KERR EERE RE RERERE EERE ERE EERE EKER EKER ERE RERER ERE RER ER ERE 

; Floating Point Format - Single Precision 

KS SS SS SS SS Se a a SS a SS a SS SS a SS SS a Sa SS i Se Se 

*| 31 | 30 | 29 | 28 | 27 | 26 | 25 | 24 | 23 | 22 | 21 | 20 | 19 | 18 | 17 | 16 

«| ----]----|----|----]----] ----] --=-]----]----] ----] ----|----|----] ----]----|---- | 

*| s | E7 | E6 | ES | £4 | E3 | E2 | El | EO | M22| M21] m20| M19| M18] M17| M16 

a a a a a a a a a a a a a a a = 

a a a a a a a a a a Sa a SS a SS a a a a a a Se a Ss = = 

*| 15 | 14 | 13 | 12 | 11 | 10 | 9 | 8] 7{ 6 5 4 3 2 1 | 0 

*| m15| m14| M13| M12| M11| M1o| M9 | ms | M7 | Mo | M5 | m4 | m3 | m2 | m1 | MO 

0 a a a a a a a a a eS ee eS 
; Single precision floating point format is a 32 bit format consisting of a * 


; 1 bit sign field, an 8 bit exponent field, and a 23 bit mantissa field. The * 
; fields are defined as follows. ba 


* 

* 

* 

_ Sign <S> : O = positive values; 1 = negative values 
Bae Exponent <E7-E0> : offset binary format 

*; 00 = special cases (i.e. zero) 

in 01 = exponent value + 127 = -126 

a FE = exponent value + 127 = +127 

*s FF = special cases (not implemented) 

*; Mantissa <M22-M0> : fractional magnitude format with implied 1 

*; 1.M22M21...M1MO 

a Range : —-1.9999998 e+127 to -1.0000000 e-126 

*; +1.0000000 e-126 to +1.9999998 e+ 

es (where e represents 2 to the power of) 
*; —3.4028236 e+38 to -1.1754944 e- 

*; +1.1754944 e-38 to +3.4028236 e+38 

es (where e represents 10 to the power of) 
* 


PRR RR RK RRR K KKK KR RK KKK KKK RK KKK KKK KR KK KKK KEK KE KK KKK KEK KR KK KKK KEK KER KKK KK ERK KKK KKK KEK 
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res_hm -usect "“flt_add”,1 ;result high mantissa 
res_1m -usect "“flt_add”,1 ;result low mantissa 
res_exp -usect "“flt_add”,1 ;result exponent 
res_sign -usect "“flt_add”,1 ; result sign 

op2_hm -usect "flt_add”,1 ; OP2 high mantissa 
op2_l1m -usect "“flt_add”,1 ; OP2 low mantissa 
op2_se -usect "“flt_add”,1 , OP2 sign and exponent 
opl_se -usect "“flt_add”,1 ; OP1 sign and exponent 
op1_hm -usect "“flt_add”,1 ; OP1 high mantissa 
op1_lm -usect "“flt_add”,1 ; OP1 low mantissa 
op1l_msw -usect “flt_add”,1 ; OP1 packed high word 
op1_lsw -usect "“flt_add”,1 ; OP1 packed low word 
op2_msw -usect “flt_add”,1 7 OP2 packed high word 
op2_lsw -usect "“flt_add”,1 7 OP2 packed low word 
err_no -usect "“flt_add”,1 ; 


KKK KKK KKK KK KKK KKK KKK KKK KKK KKK KK KKK KKK KKK KKK KKK KKK KKK KKK KKK KKK KKK KKK 


* Floating point number 12.0 can be represented as 1100 = 1.100 x 23 => sign =0 
x biased exponent = 127+3 = 130 

* 130 = 10000010 

a Mantissa 10000000000000000000000 
* Thus 12.0 can be represented as 01000001010000000000000000000000= 4140h 

*x 


KKK KKK KKK KKK KKK KK KKK KKK KKK KKK KKK KR KKK KKK KKK KKK KKK KKK KKK KKK KKK KKK KK KKK KK KKK KKK KK KKK 


K_OP1_HIGH -set 4140h , floating point number 12.0 
K_OP1_LOW -set 0000h 
K_OP2_HIGH -set 4140h ; floating point number 12.0 
K_OP2_LOW .set 0000h 
-mmregs 
sbext 
start_flt: 
RSBX C16 ; Insure long adds for later 
LD #res_hm,DP ; initialize the page pointer 
LD K_OP2_HIGH,A ; load floating #2 - 12 
STL A, op2_msw 
LD #K_OP2_LOW,A 
STL A,op2_lsw 
LD K_OP1_HIGH,A ; load floating #1 - 12 
STL A, opl_msw 
LD #K_OP1_LOW,A 
STL A,opl_lsw 


KKK KKK KKK KKK KKK KKK KK KKK KKK KKK KKK KKK KK KKK KKK KKK KKK KKK KKK KK KKK KKK KKK KK KKK KKK KKK 


CONVERSION OF FLOATING POINT FORMAT - UNPACK 
; Test OP1 for special case treatment of zero. 
Split the MSW of A in the accumulator. 
Save the sign and exponent on the stack [xxxx xxxS EEEE EEEE]. 
Add the implied one to the mantissa value 
Store entire mantissa with a long word store 
pREKREREREREKRERERERERERERER ER ERE RERERERERERERERERERERERERER ER ERE REE ER ERERER ERE 
DLD opl_msw,A 7 OP1 
SFTA A,8 
SFTA A,-8 
BC op_zero, AEQ ; if opl is 0, jump to special case 


‘eos 


~ 


~NeoNe 


+ + + F FF F HF FH 
~e 
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Example 6—10. Multiply Two Floating-Point Numbers (Continued) 


* 
* 
* 
* 
* 
* 
* 
* 


STH A,-7,opl_se ; store sign AND exponent to stack 

STL A,op1l_lm ; store low mantissa 

AND #07Fh,16,A ; mask off sign & exp to get high mantissa 
ADD #080h,16,A ; ADD implied 1 to mantissa 

STH A,opli_hm ; store mantissa to stack 


KKK KKK KKK KKK KKK KK KKK KKK KKK KKK KKK KKK KKK KKK KKK KKK KKK KKK KKK KKK KKK KKK KKK KKK KKK KKK 


’ 


; CONVERSION OF FLOATING POINT FORMAT —- UNPACK 
; Test OP2 for special case treatment of zero. 
; Split the MSW of A in the accumulator. 


: Save the sign and exponent on the stack [xxxx xxxS EEEE EEEE]. 
; Add the implied one to the mantissa value. 
7 Store entire mantissa with a long word store 
PR RERARERARERE EERE EERE RERR EERE EERE ERE REREREREAEA ER ER EAEREAE ERE ARERR ER EER KEE ERE 
DLD op2_msw,A ; load acc a with OP2 
BC op_zero,AEQ ; if OP2 is 0, jump to special case 
STH A,-7,0p2_se ; store sign and exponent to stack 
STL A,op2_l1m ; store low mantissa 
AND #07Fh,16,A ; mask off sign & exp to get high mantissa 
ADD #080h,16,A ; add implied 1 to mantissa 
STH A,op2_hm ; store mantissa to stack 


KH KKK KKK KK KKK KKK KR KKK KK KEK KK KK KKK KE KK KKK KKK KEK KKK KK RR KEK KKK KK KKK ERK KEK KEK KEK KKK 


ade SIGN EVALUATION 


*; Exclusive OR sign bits of OP1l and OP2 to determine sign of result. 
EE RRR EERE ER RE RR EERE ER RR EER ER RR BERR ER RRR ERR RRR ERR RR RR ER RR RRR RE RRR RRR 


LD opl_se,A ; load sign and exp of opl to acc 
XOR op2_se,A ; xor with op2 to get sign of result 
AND #00100h,A ; mask to get sign 

STi A,res_sign ; save sign of result to stack 


~ 


is a carry in the result that increments th xponent to 1. 
Cases when result exp = OFEh may result in overflow if there 


is a carry in the result that increments th xponent to OFFh. 
FRREKRREER ER ERE ER ER EKER ER RR ERRR ERR ERE EER RR ERERRR RRR ERE RRR RRR ERR RE EE ER ERR BRE RK 


~ 


~ 


EE RRERERER ERE ERE ER RR ERR ER ERE ERE ER ERE REE ER RR ER ER ER RR ER ERR RR ER ER ER RRR ERR RE 
at EXPONENT SUMMATION 

*; Sum the exponents of OP1 and OP2 to determine the result exponent. Since 
*; the exponents are biased (excess 127) the summation must be decremented 
*; by the bias value to avoid double biasing the result 

*; Branch to one of three blocks of processing 

a Case 1: exp OP1 + exp OP2 results in underflow (exp < 0) 

x Case 2: exp OP1 + exp OP2 results in overflow (exp >= OFFh) 

<7 Case 3: exp OP1 + exp OP2 results are in range (exp >= 0 & exp < OFFh) 
a NOTE: Cases when result exp = 0 may result in underflow unless there 
* 

*x 

* 

* 


LD opl_se,A ; Load OP1 sign and exponent 

AND #O0O0FFh,A ; Mask OP1 exponent 

LD op2_se,B ; Load OP2 sign and exponent 

AND #O0FFh,B ; Mask OP2 exponent 

SUB #07Fh,B ; Subtract offset (avoid double bias) 
ADD B,A ; Add OP1 exponent 

STL A, res_exp ; Save result exponent on stack 

BC underflow, ALT ; branch to underflow handler if exp < 0 
SUB #OFFh,A ; test for overflow 

BC overflow, AGT ; branch to overflow is exp > 127 


Hg RRR KK KKK KKK KKK KR KR KKK KKK KK KK KKK KEK KKK KK KK KK KK KKK KKK KEK KR KKK KEK KKK KK EK KEK KKK KKK 
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*e MULTIPLICATION 

*; Multiplication is implemented by parts. Mantissa for OPl is three bytes 

*; identified as Q, R, and S$ 

*; (Q represents OP1l high mantissa and R and S represent the two bytes of OP1 low 
*; mantissa). Mantissa for 

*; OP2 is also 3 bytes identified as X, Y, and Z (X represents OP2 high mant and 
*; Y and Z represent the two bytes 

*; of OP2 low mantissa). Then 

Ks oO Oo R § (mantissa of OP1) 

ae x O X Y Z (mantissa of OP2) 

aa SSS 

*; RS*YZ <-- save only upper 16 bits of result 

*; RS*0X 

*; OQ*YZ 

*; 0Q*0X <-- upper 16 bits are always zero 

x =======—==== 

* result <-- result is always in the internal 32 bits 
* 


; (which ends up in the accumulator) of the possible 64 bit product 
x KEK KK KK KKK KKK KK KKK KK KKK KKK KK KKK KKK KEK KKK KK KKK KKK KK KKK KEK KKK KKK KKK KKK KKKKKK KKK KK 


LD op1_1m,T ; load low mant of opl to T register 
MPYU op2_l1m,A A ORS: * YZ 
MPYU op2_hm,B ;, RS * OX 
ADD A,-16,B ; B = (RS * YZ) + (RS * OX) 
LD opl_hm,T ; load high mant of opl to T register 
MPYU op2_l1m,A , A= 00 * YZ 
ADD B,A ;, A = (RS * YZ) + (RS * OX) + (00 * YZ) 
MPYU op2_hm,B , B= 00 * OX 
Sti B,res_hm ; get lower word of 0Q * OX 
ADD res_hm,16,A ; A = final result 
RRR RRR RR RR RE RR RRR RR RR RRR RRR RR RRR RR RR RR IR BR RT RR RT RRR RRR TER TR RR TE TER RB BR Re 
*; POST-NORMALIZATION ADJUSTMENT AND STORAGE 
*; Set up to adjust the normalized result. 
a The MSB may be in bit 31. Test this case and increment the exponent 
ay and right shift mantissa 1 bit so result is in bits 30 through 7 
*; Right shift mantissa by 7 bits. 
xs Store low mantissa on stack. 
*; Mask implied 1 and store high mantissa on stack. 
* 


; Test result for underflow and overflow. 
KKK KK KKK KK KKK KK KKK KKK KK KKK KKK KKK KKK KKK KKK KKK KKK KK KKK KKK KK KK KKK KKK KKK KEKKKKKK KKK KKK 


ADD #040h,A ; Add rounding bit 

SFTA A,8 ; Sign extend result to check if MSB is in 31 
SFTA A,-8 

RSBX SXM ; turn off sign extension for normalization 
LD res_exp,B ; load exponent of result 

BC normalized,AGEQ ; check if MSB is in 31 

SFTL A,-1 ; Shift result so result is in bits 30:7 

ADD #1,B ; increment exponent 

STL B,res_exp ; save updated exponent normalized 

BC underflow, BLEQ ; check for underflow 

SUB #O0FFh,B ; adjust to check for overflow 

BC overflow, BGEQ ; check for overflow 

SFTL A,-7 , shift to get 23 msb bits of mantissa result 
STL A,res_lm ; store low mantissa result 
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AND #07F00h,8,A ; remove implied one 


STH A, res_hm ; store the mantissa result 


PRR RR RK RRR KK KKK EK KKK KK KKK RK KK RK KKK KEK KR KKK KR KK KKK KEK KEK KR KKK KE KK KKK KKK KEK KKK KK KK 


* 

at CONVERSION OF FLOATING POINT FORMAT - PACK 
*; Load sign. 

*; Pack exponent. 

*; Pack mantissa. 

* 


p RRR RRR RRR KK KKK RK KKK KKK RK KKK KKK KKK KR RK KKK KKK KK KKK KEK KR KKK KK KEK KR KK KKK KEK KKK KR KKK 


LD res_sign,16,A 7 0000 000S 0000 0000 0000 0000 0000 0000 

ADD res_exp,16,A ; 0000 000S EEEE EEEE 0000 0000 0000 0000 

SFTL A,7 ; SEEE EEEE E000 0000 0000 0000 0000 0000 

DADD xres_hm,A ; SEEE EEEE EMMM MMMM MMMM MMMM MMMM MMMM 
EE RRERR EER REEKR EER ER EER EER ER ERR ER ER ERR EER RR RE EERE RR ER ERR REE ERE RRR RR ER ER ER ER REE RH 
*; CONTEXT RESTORE 


Ry RR RK KKK KR RK KK RK KR KR KKK KKK KR KK KKK KKK KKK KK KK KKK KK KKK KKK KKK KKK KEK KKK KEK KEK KKK KKK 


return_value 
op_zero 
nop 
nop 
ret 
KER EKK AKA KEE KERAEKEREKE KEKE REKE EEK ER ERE EEK ER ERR ERK EA EKER ERKEK ERE EEK EA ERE ERKER ERE 
*; overflow PROCESSING 
*; Push errno onto stack. 


*; Load accumulator with return value. 
EERE RAKEREKERERER EKER EERE EERE RERER ERE REREREREAEAREE EKEERAEEA ERE AERAE EEK EERAEA ERE 


overflow 

ST #2,err_no ; Load error no 

LD res_sign,16,B ; Load sign of result 

LD #OFFFFh,A ; Result low mantissa = OFFFFh 

OR B,7,A ; Add sign bit 

BD return_value ; Branch delayed 

ADD #O7F7Fh,16,A ; Result exponent = OFEh 

; Result high mant = O7Fh 

Kee RAKK AKER ARE EEK ER EKER ER ERE EEK ER ERE RE ER ERR AERKE A EREAERK EAR ERE ERK AE EKRERK ER ERE 
*; UNDERFLOW PROCESSING 


*; Push errno onto stack. 


*; Load accumulator with return value. 
Eee REKK EKER ERE EKER ERE EERE EKER EKER ERE EERE EREAEAR EERE ER AEA EKEAREARERREARARA ERE 


underflow 
ST #1,err_no ; Load error no 
BD return_value ; Branch delayed 
SUB A,A ; For underflow result = 0 
NOP 
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xe KKEKKKKKKKKKKKKKKK KKK KKK KKK KKK KKK KKK KKK KKK KKK KKK KKK KKK KKK KKK KKK KKK KKK KKK KKK KEK 


*; FLOAT _DIV - divide two floating point numbers 

*; Copyright (c) 1993-1994 Texas Instruments Incorporated 

FRREKRERERER ERE EERER ERE EERE EER ER ER ER ER ER ER ER ERE RER ERE RE RERE BER ER ER ERE RRR ER ER ER 

; Implementation: OP1 and OP2 are each unpacked into sign, exponent, and two words 
,;of mantissa. If either exponent is zero special case processing is initiated. 
;The difference of th xponents are taken. IF the result is less than zero underflow 
;has occurred. If the result is zero, underflow may have occurred. If the result 
;is equal to 254 overflow may have occurred. If the result is greater than 254 
;overflow has occurred. 

; Underflow processing returns a value of zero. Overflow processing returns the 
;largest magnitude value along with the appropriate sign. If no special cases are 
;detected, a 24x24-bit divide is ;executed. The result of the exclusive OR of the 
;Sign bits, the difference of the exponents and the 24 bit truncated mantissa are 


;packed and returned. 
ae KKK KK KKK KK KKK KK KKK KKK KK KKK KK KKK KKK KEK KKK KKK KKK KKK KKK KKK KKK KK KKEKKKKKKKAKKKK KKK KK 


ey KKK KKK KKK KKK KK KKK KKK KKK KKK KK KKK KKK KK KKK KKK KKK KKK KKK KKK KKK KKK KKK KKK KK KKK KKK KKK 


a resource utilization: B accumulator , T register 
ca status bits affected: TC, C, SXM, OVM, C16 
Ais ntry requirements : CPL bit set 


ey KKK KKK KKK KKK KKK KK KKK KKK KKK KK KKK KKK KKK KKK KK KKK KKK KKK KKK KKK KKK KKK KKK KK KKK KKK KKK 


; Floating Point Format - Single Precision 


*; Single precision floating point format is a 32 bit format consisting of a 1 
bit sign field, an 8 bit exponent * 
*; field, and a 23 bit mantissa field. The fields are defined as follows 


H Sign <S> : O = positive values; 1 = negative values 
7 Exponent <E7-E0> : offset binary format 

; 00 = special cases (i.e. zero) 

H 01 = exponent value + 127 = -126 

; FE = exponent value + 127 = +127 

; FF = special cases (not implemented) 

; Mantissa <M22-M0> : fractional magnitude format with implied 1 

; 1.M22M21...M1M0 

; Range : -1.9999998 e+127 to -1.0000000 e-126 


+1.0000000 e-126 to +1.9999998 e+127 
(where e represents 2 to the power of) 
-3.4028236 e+38 to -1.1754944 e-38 
+1.1754944 e-38 to +3.4028236 e+ 


(where e represents 10 to the power of) 
,REKERER ARERR REE EER ER ERR EER ERE RE ER ER ERR ER ER ERE RER ER ER RE ERE RER ER ER ER ERR RER ER ERE 


Ne Ne Ne Ne 


+ + + + F F FF F F F FF F F 
~e ~ 
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PRELIMINARY 


res_hm -usect "Flt diy” , 1 
res_1m -usect "Flt diy", 1 
res_exp -usect "Fit dav, 1. 
res_sign -usect "flt_div”,1 
op2_hm -usect MELE aay 5. 
op2_lim -usect Plt oy” 5. 
op2_se -usect TEI dav, 1 
opl_se -usect "hit day, 1 
op1_hm -usect side iy a > Ne, ak 
op1_im -usect PLE ay 5 1. 
opl_msw -usect Plt aia” 71. 
opl_lsw -usect "tlt diy", 1 
op2_msw -usect sid id eam > Wl adore 
op2_lsw -usect Pe aay” 5 
err_no -usect "flit div” , 1 
-mmregs 
* 
* 
K_divisor_high -set 4140h 
K_divisor_low .set 0000h 
K_dividend_high .set 4140h 
K_dividend_low .set 0000h 
-sect "vectors” 
B float_div 
NOP 
NOP 
text 
float_div: 
LD #res_hm,DP ; initialize the page pointer 
LD #K_divisor_high,A ; load floating #2 - 12 
STL A,op2_msw 
LD K_divisor_low,A 
STL A,op2_lsw 
LD #K_ dividend_high,A ; load floating #1 - 12 
STL A,opl_msw 
LD K_dividend_low,A 
STL A,opl_lsw 
KKK KK KKK KK KKK KKK KK KKK KK KKK KKK KK KKK KK KKK KKK KKK KKKKK KK KKK KKK 
RSBX C16 ; Insure long adds for later 


*x 
Ee eRAKK AKER ERE EKER EKA EERE EERE REKER ERE EERE ERE EERE ERE EKER ERE EEK ERERE EER ER ERE 
ae CONVERSION OF FLOATING POINT FORMAT - UNPACK 
*; Test OP1 for special case treatment of zero. 
*; Split the MSW of A in the accumulator. 
*; Save the sign and exponent on the stack [xxxx xxxS EREE EEEB]. 
+7 Add the implied one to the mantissa value. 
*; Store entire mantissa with a long word store 
Ke eRAEKKEAKERAEKK ERK ER ERE EKER AK EK EEKEREKRE EEK ER ERE EEK ERE KE EEK ER EK EEK EREKE EER ER EK SE 
DLD opl_msw,A ; load acc a with OP1 
SFTA A,8 
SFTA A,-8 
BC opl_zero, AEQ ; if opl is 0, jump to special case 
STH A,-7,opl_se ; store sign and exponent to stack 
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STL A,opl_lm ; store low mantissa 

AND #07Fh,16,A ; mask off sign & exp to get high mantissa 
ADD #080h,16,A ; ADD implied 1 to mantissa 

STH A,opl_hm ; store mantissa to stack 


* 
xs KKK KK KK KKK KKK KKK KEK KKK KK KKK KK KKK KKK KEK KKK KK KKK KEK KKK KKK KEK KKK KKK KKK KKKKKKKKK KKK KK 
x CONVERSION OF FLOATING POINT FORMAT —- UNPACK 
*; Test OP1 for special case treatment of zero. 
*; Split the MSW of A in the accumulator. 
*; Save the sign and exponent on the stack [xxxx xxxS EEEE EEEE]. 
*; Add the implied one to the mantissa value. 
*; Store entire mantissa with a long word store 
a4 KKK KK KKK KK KKK KKK KEK KKK KK KKK KK KKK KKK KK KKK KEK KKK KKK KK KKK KKK KK KKK KEK KKK KKK KKKKKKKKKK 
DLD op2_msw,A ; load acc a with OP2 
BC op2_zero,AEQ ; if OP2 is 0, divide by zero 
STH A,-7,o0p2_se ; store sign and exponent to stack 
STi A,op2_l1m ; store low mantissa 
AND #07Fh,16,A ; mask off sign & exp to get high mantissa 
ADD #080h,16,A ; ADD implied 1 to mantissa 
STH A,op2_hm ; store mantissa to stack 


* 
wy KKK KKK KKK KKK KKK KK KKK KKK KK KKK KKK KKK KKK KKK KKK KK KKK KKK KKK KKK KKK KKK KKK KKK KK KKK KKK 


*; SIGN EVALUATION 


*; Exclusive OR sign bits of OP1 and OP2 to determine sign of result. 
ws KKEKKKKKKKKKKKK KKK KKK KK KKK KKK KKK KKK KKK KKK KKK KKK KKK KKK KKK KKK KKK KKK KKK KKK KKKKKEKK 


at LD opl_se,A ; load sign and exp of opl to acc 
XOR op2_se,A , xor with op2 to get sign of result 
AND #00100h,A ; mask to get sign 
STL A,res_sign ; save sign of result to stack 


KKK KK KKK KEK KKK KKK KEK KK KKK KKK KK KKK KK KKK KKK KK KKK KKK KK KKK KEK KKK KKK KEK KK KKK KKK KK KKK KK 
EXPONENT SUMMATION 

Find difference between operand exponents to determine the result exponent. 
Since the subtraction process removes the bias it must be re-added in. 


r 


~ 


~ 


Branch to one of three blocks of processing 
Case 1: exp OP1 + exp OP2 results in underflow (exp < 0) 
Case 2: exp OP1 + exp OP2 results in overflow (exp >= OFFh) 
Case 3: exp OP1 + exp OP2 results are in range (exp >= 0 & exp < OFFh) 


oN 


~ 


~ 


H NOTE: Cases when result exp = 0 may result in underflow unless there * 
is a carry in the result that increments the exponent to l. x 
Cases when result exp = OFEh may result in overflow if there is a carry * 


in the result that increments the exponent to OFFh. 
ERR ERE RARER ARERR EER ER EKER ERE RE KER ER ER ER RR ER ER ERE ER ER ER ER ER ER ERE RER ER ERR RER ER 


+ + + FF F F FF FF F F F HF F 
~ 


LD opl_se,A ; Load OP1 sign and exponent 
AND #OFFh,A ; Mask OP1 exponent 

* 
LD op2_se,B ; Load OP2 sign and exponent 
AND #0OFFh,B ; Mask OP2 exponent 
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ADD #O07Fh,A ; Add offset (difference eliminates offset) 
SUB B,A ; Take difference between exponents 
STL A, res_exp ; Save result exponent on stack 
* 
BC underflow, ALT ; branch to underflow handler if exp < 0 
SUB #OFFh,A ; test for overflow 
BC overflow, AGT 7 branch to overflow is exp > 127 
* 
Eee EEA ERK EERE EERE EKER ERE EERE ERE EER ER ERE EKER ERE EKER ERE EERE RERE EER ER ERE 
*; DIVISION 
*; Division is implemented by parts. The mantissas for both OPl and OP2 are left shifted 
* in the 32 bit field to reduce the effect of secondary and tertiary contributions to 
* the final result. The left shifted results are identified as OP1’HI, OP1’LO, OP2’HI, 
* and OP2’LO where OP1’HI and OP2’HI have the xx most significant bits of the mantissas 
* and OP1’LO and OP2’LO contain the remaining bits * of each mantissa. Let QHI and QLO 
* represent the two portions of the resultant mantissa. Then 
' d , , 
ms OHI + OLO = oe + OEE _ OPI HI aid OPI LO, 1 
OP2'HI + OP2'LO OP2'HI (1 + 2) 
OP2'HI. 
*; Now let X = OP2’LO/OP2’HI 
* Then by Taylor’s Series Expansion 
1 ee: 
7 = HX ORK BP ee dian 
(1 + x) 
on Since OP2’HI contains the first xx significant bits of the OP2 mantissa, * 
X = OP2’LO/OP2’HI < 2-yy*; Therefore the X2 term and all subsequent terms are less 
than the least significant 
= bit of the 24-bit result and can be dropped. The result then becomes 
' Uy , 
. Gar + one = S22 BE TORE Ee, 1 eign) 
OP2'HI + OP2'LO OP2'HI 
OP2'LO 
= (QHI + QLO) * |1-—<— 
Q Q OP2'HI 
=. where Q’HI and Q’LO represent the first approximation of the result. Also since 
Q’LO and OP2’LO/OP2’HI are less significant the 24th bit of the result, this 
product term can be dropped so 
' ' ' ' 
i OHI + OLO = GET ani ei ae _ OPI HI a OPI LO, 1 : 
OP2'HI + OP2'LO OP2'HI (1 + orzo) 
that OP2'HI 
EPR RAKKRAEKE KEE KE EKER EK KEKE REKE EEK ER EKER ERE KEKE EKER ERE KEKE REKRE EEKER ERE EEK ERE KE HE 
DLD opl_hm,A ; Load dividend mantissa 
SFTL A,6 ; Shift dividend in preparation for division 
* 
DLD op2_hm,B ; Load divisor mantissa 
SFTL Bat ; Shift divisor in preparation for division 
DST B,op2_hm ; Save off divisor 
* 
RPT #14 ; QHI = OP1’HI/OP2’HI 
SUBC op2_hm,A 
STL A, res_hm ; Save QHI 
* 
SUBS res_hm,A ; Clear QHI from ACC 
RPT #10 7; Q'LO = OP1’LO / OP2'HT 


SUBC op2_hm,A 
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Example 6—11. Divide a Floating-Point Number by Another (Continued) 


STL A,5,res_lm ; Save Q’LO* 
LD res_hm,T , T = Q’HI 
MPYU op2_l1m,A ; Store Q’HI * OP2’LO in acc A 
SFTL A,=1 ee 
RPT #11 ; Calculate Q’HI * OP2’LO / OP2’HI 
SUBC op2_hm,A ; (correction factor) 
SFTL A,4 ; Left shift to bring it to proper range 
AND #OFFFFh,A ; Mask off correction factor 
* 
NEGA ; Subtract correction factor 
ADDS res_lm,A ; Add Q’LO 
ADD res_hm,16,A ; Add Q’HI 


* 


ey KKK KKK KKK KKK KK KKK KKK KKK KKK KK KKK KKK KKK KKK KK KKK KKK KKK KKK KK KKK KKK KKK KKK KKK KKK KKK 


a POST-NORMALIZATION ADJUSTMENT AND STORAGE 
*; Set up to adjust the normalized result. The MSB may be in bit 31. Test this 
case and increment the exponent and right shift mantissa 1 bit so result is in 
bits 30 through 7. Right shift mantissa by 7 bits. Store low mantissa on stack. 
Mask implied 1 and store high mantissa on stack. Test result for underflow and 


overflow. 
Ks KKK KK KKK KK KKK KK KKK KKK KK KKK KK KKK KKK KEK KKK KKK KKK KKK KKK KKK KKK KK KKEKKKKKKKKK KK KKK KK 


* 


LD res_exp,B ; Load result exponent 

EXP A ; Get amount to adjust exp for normalizationNOP 
NORM A ; Normalize the result 

ST T,res_exp ; Store the exponent adjustment value 

SUB res_exp,B ; Adjust exponent (add either zero or one) 
SFTL A,-1 ; Pre-scale adjustment for rounding 

ADD #1,B ; Adjust exponent 

ADD #020h,A ; Add rounding bit 

EXP A ; Normalize after rounding NOP 

NORM A Hi 

ST T,res_exp ; Adjust exponent for normalization 

SUB res_exp,B ; 

STL B, res_exp ; Save exponent 

BC underflow, BLEQ ; process underflow if occurs 

SUB #OFFh,B ; adjust to check for overflow 

BC overflow, BGEQ 7 process overflow if occurs 

SFTL A,-7 , Shift right to place mantissa for splitting 
STL: A, res_lm ; Save result low mantissa 

AND #07F00h, 8, ; Eliminate implied one 

STH A, res_hm ; Save result mantissa on stack 


: KKK KKK KKK KKK KK KKK KKK KKK KKK KK KKK KKK KKK KKK KKK KK KKK KKK KKK KK KKK KKK KKK KKK KKK KKK KKK 


* 

ke 

ay CONVERSION OF FLOATING POINT FORMAT - PACK 
*; Load sign. 
ke 
ke 
* 
* 


; Pack exponent. 
; Pack mantissa. 


p RRR RRR RRR KKK KK KKK EK KR KKK KKK KKK KKK KK EK KKK KKK KK KKK KKK KEK KEK RRR K KKK KEK KKK KK KK ERK 
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Example 6—11. Divide a Floating-Point Number by Another (Continued) 


LD res_sign,16,A 7 0000 000S 0000 0000 0000 0000 0000 0000 
ADD res_exp,16,A 7 0000 000S EEEE EEEE 0000 0000 0000 0000 
SFTL A,7 ; SEEE EEEE E000 0000 0000 0000 0000 0000 
DADD res_hm,A ; SEEE EEEE EMMM MMMM MMMM MMMM MMMM MMMM 


RK RK KKK KEK KKK KK KKK KKK KEK KK KK KKK KEK KKK KK KKK RK KKK KKK RK KKK KKK KEK KKK KK EK KE KK KKK KER 


ee CONTEXT RESTORE 
RI IR IRR IR IRR IR IRI IR II RIOR IIR IR IIR IR IIR IO I I II IO I Ak 


return_value 
opl_zero 


Fret 
* 


Ry RRR KKK KEK KK RK KKK KKK KK KKK KKK KKK KKK KKK KK KK KEK KK KK KKK KEK KR KKK KKK KEK KK KEK KKK KEK KKK 


+s OVERFLOW PROCESSING 
*; Push errno onto stack. 


*; Load accumulator with return value. 
Ee eRAKKEEKE RAKE EKER EK EEK ERE KE EEK ER ERE EEK ER KKK REKERERE KEKE REKE EEKEREKE EEK ERE KE 


overflow 
ST #2,err_no ; Load error no 
SAT A ; Result exponent = OFEh 
SUB #081h,16,A ; Result high mant = O7Fh 
BD return_value ; Branch delayed 
LD res_sign,16,B ; Load sign of result 
OR B,7,A ; Pack sign* 


Hg RRR KK RK KR RK KKK KK RK RK KK KKK KR KK KKK KR KK KKK KKK KEK KK KK KKK KEK KKK KKK KEK KK KEK KEK KEK KKK 


ke 
’ 


UNDERF LOW PROCESSING 
*; Push errno onto stack. 


*; Load accumulator with return value. 
EER RAEKK RAKE RAKE EKER ERE EEK ER EKE EEK ER ERE EEK ERK EEK EREKE EKER EKER EEK EREKE EEK ERE KE 


* 

underflow 
ST #1,err_no ; Load error no 
BD return_value ; Branch delayed 
sub A,A ; For underflow result = 0 
nop 

Kk. 

v 


KKK KKK KKK KKK KKK KKK KKK KKK KK KKK KK KKK KKK KK KKK KK KKK KKK KK KKK KKK KKKKKKAKK KK KK KKK KK KK 
*; DIVIDE BY ZERO 
*; Push errno onto stack. 


*; Load accumulator with return value. 
Ee RRERERER REE ERE ER REE REE ERR ERR ER ERE KER EER EER ER BERR ER ER EER RE RRR EERE RRR BER 


op2_zero 
ST #3,err_no ; Load error no 
SAT A ; Result exponent = FEh 
; Result low mant = FFFFh 
LD opl_se,16,B ; Load sign and exponent of OP1 
AND #100h,16,B ; Mask to get sign of OP1 
OR B,7,A ; Pack sign 
BD return_value ; Branch delayed 
SUB #081h,16,A ; Result high mant = 7Fh 
NOP 
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6.6 Logical Operations 


DSP-application systems perform many logical operations, including bit manipu- 
lation and packing and unpacking data. A digital modem uses a scrambler and 
adescrambler to perform bit manipulation. The input bit stream is in a packed for- 
mat of 16 bits. Each word is unpacked into 16 words of 16-bit data, with the most 
significant bit (MSB) as the original input bit of each word. The unpack buffer con- 
tains either 8000h or 0000h, depending upon the bit in the original input-packed 
16-bit word. The following polynomial generates a scrambled output, where the 
® sign represents modulus 2 additions from the bitwise exclusive OR of the 
data values: 


Scrambler output = 1 @ x-18 @ x-23 


The same polynomial sequence in the descrambler section reproduces the 
original 16-bit input sequence. The output of the descrambler is a 16-bit word 
in packed format. 


Example 6-12. Pack/Unpack Data in the Scrambler/Descrambler of a Digital Modem 


; TEXAS INSTRUMENTS INCORPORATED 
-mmregs 
-asg AR1, UNPACK_BFFR 
-asg AR3,SCRAM_DATA_18 
-asg AR4,SCRAM DATA 23 
-asg AR2,DE_SCRAM_DATA_18 
-asg AR5,DE_SCRAM_DATA_23 
d_scram_bffr -usect "scrm_dat”, 30 
d_de_scram_bffr -usect "“dscrm_dt”, 30 
d_unpack_buffer -usect "scrm_var”,100 
d_input_bit -usect "scrm_var”,1 
d_pack_out -usect "“scrm_var”,1 
d_asm_count -usect “"scrm_var”,1 
K_BFFR_SIZE .set 24 
K_16 .set 16 
.def d_input_bit 
.def d_asm_count 


Functional Description 

This routine illustrates the pack and unpack of a data stream and 
also bit manipulation. A digital scrambler and descrambler does the 
bit manipulation and the input to the scrambler is in unpacked format 
and the output of the descrambler is in packed 16-bit word. 
scrambler_output = 1+x%*-18+x%*-23 

additions are modulus 2 additions or bitwise exclusive OR of data 


values. 
output. 


~sect 


The same polynomial is used to generate the descrambler 


"scramblr” 


scrambler_init: 


STM 
STM 
RPTZ 
STL 
STM 


#d_unpack_buffer, UNPACK_BFFR 
#d_scram_bffr, SCRAM_DATA 23 

A, #K_BFFR_SIZE 

A, *SCRAM_DATA_23+ 
#d_scram_bffr+K_BFFR_SIZE-1,SCRAM_DATA_23 
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Example 6—12. Pack/Unpack Data in the Scrambler/Descrambler of a Digital Modem 


STM 

STM 

STM 

LD 

ST 
scramler_task 
; the unpack 


; is either 1 


unpack_data: 
STM 
RPTB 


PORTR 
LD 


STL 


(Continued) 


#d_scram_bffrt+17,SCRAM DATA 18 


#d_input_bit,Dp 

#-K_16+1,d_asm_count 

data buffer has either 8000h or 
or 0 


K_16-1,BRC 
end_loop-1 


1h, d_input_bit 
d_input_bit,15,A 


A, *UNPACK_BFFR 


unpack_16_words 


scrambler: 
LD 
XOR 
XOR 
STL 


STL 
scramble_word 
descrambler: 

LD 

XOR 

XOR 

STL 


STL 


*SCRAM_DATA_18-%,A 
*SCRAM_DATA_23,A 
*UNPACK_BFFR,A 

A, *SCRAM_DATA_23-% 


A, *UNPACK_BFFR 
*DE_SCRAM_DATA_18-%,A 
*DE_SCRAM_DATA_23,A 
*UNPACK_BFFR,A 


A, *DE_SCRAM_DATA_23-% 


A, *UNPACK_BFFR 


de_scramble_word 
; ASM field shifts the descrambler output MSB into proper bit position 


, 
pack_data 
RSBX 
LD 
LD 
LD 
OR 
STL 
ADDM 
pack_word 
SSBX 
end_loop 
NOP 


NOP 
.end 
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SXM 
d_asm_count, ASM 
*UNPACK_BFFRt+,A 
A,ASM,A 
d_pack_out,A 

A, d_pack_out 
#1,d_asm_count 


SXM 


d_de_scram_bffr+K_BFFR_SIZE-1,DE_SCRAM DATA 23 
d_de_scram_bffr+17,DE_SCRAM _DATA_18 


0000h since the bit stream 


7 unpack the data into 16-bit 
; word 

; read the serial bit stream 
; mask thelower 15 bits 

; the MSB is the serial bit 

; stream 

; store the 16 bit word 


; A x*-184+x%*-23 

; A = A+x%*0 

; newest sample, for next 
; cycle it will be x(n-1) 
; store the scrambled data 


; A x*-184+x%*-23 

; A = A+x%*0 

; newest sample, for next 
7 cycle it will be x(n-1) 
; store the scrambled data 


; reset the SXM bit 


; start pack the data 


; enable SXM mode 


; GQummy instructions nothing 
; with the code 
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Application-Specific Examples 


This chapter shows examples of typical applications for the 54x. Since this DSP 
is widely used for speech coding and telecommunications, the applications 
show how some aspects of these functions are implemented. 


Topic Page 
7.1. Codebook Search for Excitation Signal in Speech Coding ........ 7-2 
7.2 Viterbi Algorithm for Channel Decoding ..............0eeeeeeees [7-5| 

7-1 
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7.1. Codebook Search for Excitation Signal in Speech Coding 


Acode-excited linear predictive (CELP) speech coder is widely used for applica- 
tions requiring speech coding with a bit rate under 16K bps. The speech coder 
uses a vector quantization technique from codebooks to an excitation signal. 
This excitation signal is applied to a linear predictive-coding (LPC) synthesis fil- 
ter. To obtain optimum code vectors from the codebooks, a codebook search 
is performed, which minimizes the mean-square error generated from weighted 
input speech and from the zero-input response of a synthesis filter. Figure 7—1 
shows a block diagram of a CELP-based speech coder. 


Figure 7-1. CELP-Based Speech Coder 


Input speech »| Weighting 
filter 


Codebook 


To locate an optimum code vector, the codebook search uses Equation 7-1 
to minimize the mean-square error. 


Equation 7-1. Optimum Code Vector Localization 


E, = Y{p(n) - y,g, (n)}" N : Subframe 


The variable p(n) is the weighted input speech, gj(n) is the zero-input response 
of the synthesis filter, and ; is the gain of the codebook. 


The cross-correlation (c;) of p(n) and gj(n) is represented by Equation 7—2. The 
energy (Gj) of gj(n) is represented by Equation 7-3. 
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Codebook Search for Excitation Signal in Speech Coding 


Equation 7-2. Cross Correlation Variable (c;) 


ci = Dy gi* p(n) 
i=0 


Equation 7—3. Energy Variable (G;) 


N+ 
G = gi? 


i=0 


Equation 7—1 is minimized by maximizing c;2 / G;. Therefore, assuming that a 
code vector with i = opt is optimal, Equation 7—4 is always met for any i. The 
codebook search routine evaluates this equation for each code vector and 
finds the optimum one. 


Equation 7—4. Optimal Code Vector Condition 
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Example 7-1 shows the implementation algorithm for codebook search on 
54x. The square (SQUR), multiply (MPYA), and conditional store (SRCCD, 
STRCD, SACCD) instructions are used to minimize the execution cycles. AR5 
points to cj and AR2 points to Gj. AR3 points to the locations of Gop; and Copt?- 
The value of i(opt) is stored at the location addressed by AR4. 
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Example 7-1. Codebook Search 


7-4 


SEARCH: 


Srh_End: 


PRELIMINARY 


-title ”CODEBOOK SEARCH” 


-mmregs 

text 

STM #C,AR5 ;Set C(i) address 

STM #G,AR2 ;Set G(i) address 

STM #OPT, AR3 ;Set OPT address 

STM #IOPT, AR4 ;Set IOPT address 

ST #0, *AR4 ;Initialize lag 

ST #1, *AR3+ ;Initialize Gopt 

ST #0, *AR3- ; Initialize C2opt 

STM #N-1, BRC 

RPTB Srh_End-1 

SQUR *AR5+,A ;A = C(i) * C(i) 

MPYA *AR3+ 7B = C(i)*2 * Gopt 

MAS *AR2+, *AR3-,B 7B = C(i)*2 * Gopt —- 
7G(i) * C2opt,T = G(i) 

SRCCD *AR4,BGEQ ,;if(B >= 0) then 
;iopt = BRC 

STRCD *AR3+,BGEQ ;if(B >= 0) then 
7Gopt = T 

SACCD A, *AR3-,BGEQ ;if(B >= 0) then 
7C2Zopt = A NOP 

NOP ;To save current BCR 
7*AR4 —> optimal index 

RET 

.end 
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7.2 Viterbi Algorithm for Channel Decoding 


Convolutional encoding with the Viterbi decoding algorithm is widely used in 
telecommunication systems for error control coding. The Viterbi algorithm 
requires a computationally intensive routine with many add-compare-select 
(ACS) iterations. The ’54x can perform fast ACS operations because of dedi- 
cated hardware andinstructions that support the Viterbi algorithm on chip. This 
implementation allows the channel decoder and the equalizer in communica- 
tion systems to be used efficiently. 


In the global system for mobile communications (GSM) cellular radio, the poly- 
nomials in Equation 7—5 are used for convolutional encoding. 


Equation 7-5. Polynomials for Convolutional Encoding 
G1(D) = 1 + D34 D4 G2(D) = 1+D+D3+D4 


This convolutional encoding can be represented in a trellis diagram, which 
forms a butterfly structure as shown in Figure 7—2. The trellis diagram illus- 
trates all possible transformations of convolutional encoding from one state to 
another, along with their corresponding path states. There are 16 states, or 
eight butterflies, in every symbol time interval. Two branches are input to each 
state. Decoding the convolutional code involves finding the optimal path by 
iteratively selecting possible paths in each state through a predetermined 
number of symbol time intervals. Two path metrics are calculated by adding 
branch metrics to two old-state path metrics and the path metric (J) for the new 
state is selected from these two path metrics. 


Figure 7-2. Butterfly Structure of the Trellis Diagram 


Old state New state 


2*J 


a*s+1 
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Equation 7-6 defines a branch metric. 


Equation 7-6. Branch Metric 


M = SD(2*i) * B(J,0) + SD(2*i+1) * B(J,1) 


SD(2*i) is the first symbol that represents a soft-decision input and SD(2*i+1) 
is the second symbol. B(J,0) and B(J,1) correspond to the code generated by 
the convolutional encoder as shown in Table 7-1. 


Table 7-1. Code Generated by the Convolutional Encoder 
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J BW) BU) 
_ — = —# 
1 -1 A 

2 1 1 

3 1 -1 

4 1 -1 

5 1 1 

6 1 = 

7 -1 1 


The 54x can compute a butterfly quickly by setting the ALU to dual 16-bit mode. 
To determine the new path metric (J), two possible path metrics from 2*J and 
2*J+1 are calculated in parallel with branch metrics (M and —M) using the 
DADST instruction. The path metrics are compared by the CMPS instruction. 


To calculate the new path metric (J+8), the DSADT instruction calculates two 
possible path metrics using branch metrics and old path metrics stored in the 
upper half and lower half of the accumulator. The CMPS instruction determines 
the new path metric. 


The CMPS instruction compares the upper word and the lower word of the 
accumulator and stores the larger value in memory. The 16-bit transition regis- 
ter (TRN) is updated with every comparison so you can track the selected path 
metric. The TRN contents must be stored in memory locations after proces- 
sing each symbol time interval. The back-track routine uses the information in 
memory locations to find the optimal path. 
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Example 7-2 shows the Viterbi butterfly macro. A branch metric value is 
stored in T before calling the macro. During every butterfly cycle, two macros 
prevent T from receiving opposite sign values of the branch metrics. 
Figure 7-38 illustrates pointer management and the storage scheme for the 
path metrics used in Example 7-2. 


In one symbol time interval, eight butterflies are calculated for the next 16 new 
states. This operation repeats over a number of symbol time intervals. At the 
end of the sequence of time intervals, the back-track routine is performed to 
find the optimal path out of the 16 paths calculated. This path represents the 
bit sequence to be decoded. 


Figure 7-3. Pointer Management and Storage Scheme for Path Metrics 
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Pointer Location (relative) 
AR5 ——» o 
Metrics 
R244 Old state 
15 — 
AR4 ——»> 16 
Metrics 
J 
24 New state 
AR3 ——»> 
Metrics 
J+8 
317 
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Example 7-2. Viterbi Operator for Channel Coding 


VITRBF . MACRO ; 
DADST *AR5,A ;A = OLD_M(2*J)+T//OLD_(2*J+1)-T 
DSADT *AR5+,B 7B = OLD_M(2*J)-T//OLD_(2*J+1)+T 
CMPS A, *AR4+ ;NEW_M(J) = MAX (A_HIGH, A_LOW) 
; TRN<<1, TRN(0,0) = TC 
CMPS’ B, *AR3+ ;NEW_M(J+8) = MAX (B_HIGH, B_LOW) 
, TRN<<1, TRN(O,) = TC 
. ENDM 
VITRBR . MACRO ; 
DSADT *AR5,A 7A = OLD_M(2*J)-T//OLD_(2*J+1)+T 
DADST *AR5+,B ;B = OLD_M(2*J)+T//OLD_(2*J+1)-T 
CMPS A, *AR4+ ;NEW_M(J) = MAX (A_HIGH, A_LOW) 
; TRN<<1, TRN(0,0) = TC 
CMPS' B, *AR3+ ;NEW_M(J+8) = MAX (B_HIGH, B_LOW) 
; TRN<<1, TRN(0O,) = TC 
. ENDM 


78 PRELIMINARY 


PRELIMINARY 


PRELIMINARY 


Chapter 8 


Bootloader 


The bootloader lets you load and execute programs received from a host 
processor, EPROMs, or other standard memory devices. The ’54x devices 
provide different ways to download the code to accommodate various system 
requirements. Some applications use a serial interface. If the code exists in 
external ROM, a parallel interface is appropriate. This chapter uses the 542 
as a reference platform for HPI bootloader option platform and ’541 for other 
bootloader options available on ’54x. 


Topic Page 
Si) (Boot Mode:Seloctlonicseecwacss ses eranas seoceee erase cce cance B-2| 
8.2 Host Port Interface (HPI) Boot Loading Sequence ................ 
8:3, 5 16-Bite-Bit Parallel Boots... 1. sncsce roan shies ceva ste cee ns B-5| 
6.4 VOBOOE i. ccceresnsocrcwnessseundenscwssersnetesessean ns 
85. Standard’Sonal Bool jas .ccesie strc ue ss patna eee eataane 
GH WETIIEC cocosconnopspcosnonsssunpooeendousoDonsooRonBHoone 8-12 
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8.1 Boot Mode Selection 


8-2 


Execution begins at location FF80h of the on-chip ROM if the MP/MC pin of 
the 54x is sampled low during a hardware reset. This location contains a 
branch instruction to start the bootloader program that is factory-programmed 
in ROM. This program sets up the CPU status registers before initiating the 
bootload. Interrupts are globally disabled (INTM = 1) and internal dual-access 
and single-access RAMs are mapped into program/data space (OVLY = 1). 
Seven wait states are initialized for all the program and data spaces. The size 
of the external memory bank is set to 4K words. It uses one latency cycle when 
switching between program and data space. 


The boot routine reads the I/O port address OFFFFh by driving the I/O strobe 
(IS) signal low. The lower eight bits of the word read from I/O port address 
OFFFFh specify the mode of transfer. The boot routine selection (BRS) word 
determines the boot mode. The BRS word uses a source (SRC) field when in 
parallel EPROM mode and an entry address (ADDR) field when using a warm 
boot (see Section 8.6, Warm Boot on page 8-12). The six least significant bits 
and the configuration of CLKX and FSX pins determine whether to use the 8- 
or 16-bit bootload serial boot option. The BRS word also determines the 8- or 
16-bit parallel I/O mode. 


The host port interface (HPI) uses interrupt 2 to bootload (INT2). If INT2 is not 
latched, the boot routine skips HP! boot mode (see Section 8.2, Host Port Inter- 
face (HPI) Bootloading Sequence on page 8-4). It reads the lower eight bits 
from the I/O address, OFFFFh, to determine the boot mode. Figure 8-1 illus- 
trates the boot mode selection process. 
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Figure 8—1. Boot Mode Selection Process 


[ — Seal oot modes 


BRS = Ye Configure BSP for 8-bit 
x00 00002 (FSX/CLKX as output) 
; serial port boot mode 
No 


Configure BSP for 16-bit 
(FSX/CLKX as output) 
serial port boot mode 


Initialization 


[HPI boot mode | 


Begin execution @ 
hpiram 


| ee 


Read BRS word from I/O 
@ address OFFFFh 


BRS = 
XXxx Xx00? 
No 


I/O boot mode 


N S Configure BSP for 8-bit | 
ae bee (FSX/CLKX as input) 
— / Y serial port boot mode 


8-bit parallel I/O 16-bit parallel I/O Configure BSP for 16-bit 
mode mode (FSX/CLKX as input) 
serial port boot mode 


Parallel EPROM Boot Modes 


epee Yes 8-bit parallel Configure TDM for 8-bit 
" S EPROM mode (FSX/CLKX as output) 
RAN serial port boot mode 
gas Configure TDM for 16-bit 
= P 16-bit parallel (FSX/CLKX as output) 
XXX XX003 EPROM mode serial port boot mode 
__ No 


Warm boot mode - 
16-bit TDM standard 


Warm boot mode 


(BRS = xxxx xx11) 


(FSX/CLKX as input) 
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8.2 Host Port Interface (HPI) Boot Loading Sequence 
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The HPI is an 8-bit parallel port that interfaces a host processor to the 542. 
The host processor and the ’542 exchange information via on-chip, shared 
memory. The host interrupts the CPU by writing to the HPI control register 
(HPIC). The CPU interrupts the host by asserting the host interrupt (HINT) sig- 
nal. The host can acknowledge or clear this signal. The signal determines 
whether the HPI boot option is selected by asserting HINT low. This signal is 
tied to the external interrupt INT2 input pin if HPI boot mode is selected. 
(Instead of tying HINT to INT2, you can send a valid interrupt to the INT2 input 
pin within 30 CLOCKOUT cycles after the 542 fetches the reset vector.) 


Asserting HINT low sets the corresponding interrupt flag register (IFR) bit. The 
bootloader waits for 20 CLKOUT cycles after asserting HINT and reads bit 2 
of IFR. If the bit is set (indicating that INT2 is recognized), the bootloader trans- 
fers control to the start address of the on-chip HPI RAM (1000h in program 
space) and executes code from that point. If bit 2 of the IFR is not set, the boot 
routine skips HPI boot mode and reads BRS from the I/O address, OFFFFh, 
in I/O space. The lower eight bits of this word specify the mode of transfer. The 
bootloader ignores the rest of the bits. 


If HPI boot mode is selected, the host must download the code to on-chip HPI 
RAM before the HPI brings the device out of reset. The bootloader keeps HPI 
in shared-access mode (SMODE = 1) during the entire operation. Once HINT 
is asserted low by the bootloader, it stays low until a host controller clears it 
by writing to HPIC. 
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PRELIMINARY 16-Bit/8-Bit Parallel Boot 


8.3 16-Bit/8-Bit Parallel Boot 


The parallel boot option is used when the code is stored in ERROMs (8 or 16 
bits wide). The code is transferred from data to program memory. The six most 
significant bits (MSBs) of the source address are specified by the SRC field of 
the BRS word, as shown in Figure 8-2. 


Figure 8-2. 16-Bit EPROM Address Defined by SRC Field 


15 10 9 8 7 6 5 4 3 2 1 0 
| SRC Pote Tote foftefofte fo fe] 
Note: SRC = Source address 


If 16-bit parallel mode is selected, data is read in 16-bit words from the source 
address and incremented by 1 after every read operation. The destination 
address and the length of the code are specified by the first two 16-bit words. 
The length is defined as: 


Length = number of 16-bit words to be transferred — 1 


The number of 16-bit words specified by length does not include the first two 
words read (destination and length parameters), starting from the source ad- 
dress. This is shown in Figure 8-3. The code is transferred from data memory 
(source address) to program memory (destination address). At least a 
10-cycle delay occurs between a read from EPROM and awrite to the destina- 
tion address. This ensures that if the destination is external memory, like fast 
SRAM, there is enough time to turn off the source memory (EPROM) before 
the write operation is performed. After the code is transferred to program 
memory, the ‘541 branches to the destination address. This occurs for both for 
16-bit and 8-bit parallel boot options. 


Figure 8-3. Data Read for a 16-Bit Parallel Boot 


15 0 


Destination 16 


Length 4g =N-1 


Code Word (1) 16 


Code word (N) 16 


Notes: 1) Destination 1g = 16-bit destination 
2) Length 16 = 16-bit word that specifies the length of the code (N) that follows it 
3) Code word (N) 1g =N 16-bit words to be transferred 
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16-Bit/8-Bit Parallel Boot PRELIMINARY 


If the 8-bit parallel boot option is selected (see Figure 8—4), two consecutive 
memory locations (starting at the source address) are read to make one 16-bit 
word. The high-order byte must be followed by the low-order byte. Data is read 
from the lower eight data lines, ignoring the upper byte on the data bus. The 
destination address is a 16-bit word that constitutes the address in program 
space where the boot code is transferred. The length is defined as: 


Length = number of 16-bit words to be transferred — 1 


= (number of bytes to be transferred / 2) —1 


Figure 8-4. Data Read During 8-Bit Parallel Boot 


7 0 
Destinationp 


Destination, 
Length, = N-1 
Length; = N -1 
Code word (1) 
Code word (1); 


Code word (N)p, 
Code word (N); 


Notes: 1) Destinationy and Destination, represent high and low bytes of destination ad- 
dress 


2) Lengthp and Length represent high and low bytes of a 16-bit word that speci- 
fies the length N of the code that follows it. 


3) Nh and N| bytes constitute N words to be transferred. 
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PRELIMINARY 16-Bit/8-Bit Parallel Boot 


Figure 8—5 shows the parallel boot sequence, both for the 16- and 8-bit op- 
tions. 


Figure 8-5. 8-Bit/16-Bit Parallel Boot 


Start 


Load and save 16-bit 
EPROM address (SRC) 8-bit read 

DA, code length, 
code word from SRC 


Read DA 
Read HB from SRC 


Read & copy code 
length to BRC 


Read LB from SRC 


tH 


Branch to DA 16-bit read 
DA, code length, 


Read code word from code word from SRC 
SRC and save code Start executing code 


word in PM Read 16-bit word from 
SRC 


Increment SRC 


Increment PM 


Decrement code length 


{itt 
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//O Boot 


8.4 


I/O Boot 


PRELIMINARY 


The I/O boot mode provides asynchronous transfer code from I/O address Oh 
to internal/external program memory. Each word can be 16 or 8 bits long. The 
‘541 communicates with external devices using the BIO and XF handshake 
lines. The handshake protocol shown in Figure 8-6 is required to successfully 
transfer words from I/O address Oh. 


Figure 8-6. Handshake Protocol 


8-8 


BIO 


XF 


Host request 
data transmit 


' °541 ready to ' Host data valid : 


B41 : 
seein ' 4 -word transfer ' acknowledges Next word transfer 
' data received ! 


Figure 8-6 shows a data transfer initiated by the host, which drives the BIO 
pin low. When BIO goes low, the 541 inputs data from I/O address Oh, drives 
the XF pin high to indicate to the host that the data has been received, and 
writes the input data to the destination address. The 541 then waits for the BIO 
pin to go high before driving the XF pin low. The low status of the XF line can 
be polled by the host for the next data transfer. 


If 8-bit transfer mode is selected, the lower eight data lines are read from I/O 
address, Oh. The upper byte on the data bus is ignored. The 541 reads two 
8-bit words to form a 16-bit word. The low byte of each 16-bit word must follow 
the high byte. Figure 8—7 shows the I/O boot sequence, both for the 16- and 
8-bit options. 


For both 8- and 16-bit I/O, the first two 16-bit words received by the 541 must 
be the destination and length of the code, respectively. A minimum delay of 10 
clock cycles occurs between the rising edge of XF and the write to the destina- 
tion address. This allows the processor sufficient time to turn off its data buffers 
before the ’541 initiates the write operation if the destination is external 
memory. The 541 accesses the external bus only when XF is high. 
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Figure 8-7. 8-Bit/16-Bit I/O Boot Mode 


8-bit/16-bit parallel |/O mode 


Handshake with host 


Read D/A from I/O Oh 


Handshake with host 


Read code length from 
1/0 Oh 


> 
Handshake with host 
Read D/A from I/O Oh 


Transfer data from DM 
into PM and increment 
PM 


Code 
length 


0 
No Branch to DA 
Decrement code length 


Start executing code 
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8-bit read 
DA, code length, 
code word from I/O Oh 


Handshake with host 


Read HB from I/O 0h 

Handshake with host 

Read LB from I/O 0h 
Vv 


16-bit handshake & read 
DA, code length, 
code word from I/O Oh 


Handshake with host 


Read 16-bit word from 
/O Oh 


Bootloader 


//O Boot 


Standard Serial Boot 
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8.5 Standard Serial Boot 
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The ’541 serial boot option can use either the buffered serial port (BSP) or 
time-division multiplexed (TDM) serial port in standard mode during booting. 
Eight modes are available for the serial boot option (see Figure 8-1, Boot 
Mode Selection Process, on page 8-3). The word length (8- or 16-bit) and the 
configuration of the CLKX/FSX pins determines the correct mode to use. For 
8-bit operation, set the following bits: 


CV Receive reset signal, RRST (to take serial port 0 out of reset) 

_j Transmit mode bit, TXM 

L1 Clock mode bit, MCM (so that CLKX is driven by an on-chip clock source) 
a) 


Frame sync mode bit, FSM (frame sync pulses must be supplied external- 
ly on the FSR pin) 


_j Format bit, FO (so that data is transferred in an 8-bit length) 


This translates to a value of 3Ch to the SPC register, which puts the serial port 
in reset. The RRST and XRST of SPC register are set to 1 to take the serial 
port out of reset for configuring CLKX/FSX as output pins in 8-bit mode. Then 
avalue of FCh is written to the SPC register for 16-bit mode, FO =0. This writes 
38h to the SPC register to put the serial port in reset. It also writes a 1 to both 
RRST and XRST to pull the serial port register out of reset and configure 
CLKX/FSX as output pins, and a value of F8h is written to the SPC register. 


To drive the CLKX and FSX pins as inputs, the MCM and TXM bits are disabled 
and the serial port is configured for 8- and 16-bit mode options. The external 
flag, XF, sends a signal that the ‘541 is ready to respond to and receive from 
the serial port. The XF flag is set high at reset and is driven low to initiate recep- 
tion. No frame sync pulses can appear on FSR before XF goes low. For the 
buffered serial port, the BSP control extension register (BSPCE) is initialized 
to set the CLKX to 1/4 of the CLKOUT signal. Figure 8—8 shows the serial boot 
sequence. 
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PRELIMINARY 


Figure 8-8. Serial Boot Mode 


Serial boot mode 


Configure SPC register to 
put in reset and then pull 
out of reset. Configure 
BSPCE register 


Read DA from SP 
Read code length 
from SP 


Read code word from 
SP and save the code 
word in DM 


Transfer data from DM 
into PM and increment 
PM 


No 
Decrement code length 
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Branch to DA 


Start executing code 


Standard Serial Boot 


8-bit read 
DA, code length, 
code word from SP 


Read HB from SP 
Read LB from SP 


16-bit read 
DA, code length, 
code word from SP 


Read 16-bit word 
from SP 


Bootloader 
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Warm Boot PRELIMINARY 


8.6 Warm Boot 


The ’541 transfers control to the entry address if the warm boot option is specified. 
The warm boot option can be used after a warm device reset or if the program 
has already been transferred to internal or external memory by other means, such 
as a direct memory access (DMA). For a warm boot, the six MSBs at the entry 
point of the code are specified by the SRC or ADDR fields of the BRS word, as 
shown in Figure 8-9. 


Figure 8-9. Warm Boot Address Specified in BRS Word 
15 10 9 8 a 6 5 4 3 2 1 0 
ee ee eee re ee 


Note: ADDR = 6-bit page address 


Example 8-1. Warm Boot Option 


KKK KKK KKK KK KKK KKK KK KKK KKK KKK KKK KKK KKK KKK KKK KKK KKK KK KKK KKK KKK 


* FILENAME : BOOTC542.ASM 
* 


* This code segment sets up and executes bootloader code based upon 
i data saved in data memory 
KKKKKKKKKKKKKKKKKKKKKK KKK KKK KKK KKK KKKKKKKKKKKKKKKKKKKKKKKKKK 

wertle "“bootc542” 
KKEKKKKKKKKKKKKKKKKK KK KKK KKK KKKKKKKKKKKKKKKKKKKKKKKKKKKK KKK 
* symbol definitions 
KKKKKKKKKKKKKKKKKKKKKK KKK KKK KKKKKKKKKKKKKKKKKKKKKKKKK KK KKK 

-Immregs 

-mnolist 

def boot 

def endboot 

def bootend 

.deft bsprcv_isr 

.def tdmrcv_isr 

def dest 

def src 

.deft ingth 

def s8word 

.def hbyte 

def state 
* .ref boota ; reserved for ROM Code customer 
boota -set 0184h ; Arbitrary ref (for USR bootcode) 
* Conditional Assembly Flags 
C542 -set 1 
* 
pad -set Oh ; port address Oh for i/o boot load 
brs -set 60h 7 boot routine select (configuration word) 
bootmode -set 61h ; boot mode extracted from brs 
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Example 8-1. Warm Boot Option (Continued) 


s8word .set 62h ; concatenator for 8-bit serial load 
hbyte .set 63h ; high byte of 8-bit serial word 
p8word .set 64h 7 concatenator for 8-bit memory load 
sre .set 65h ; source address 
dest .set 66h ; destination address (dmov from above) 
ingth .set 67h 7 code length 
temp .set 68h ; temporary register 
state .set 69h ; serial I/O state vector 
nmintv .set 6ah 7 non-maskable interrupt vector 
x Bit equates 
b0 .set 00h 
b4 .set 04h 
b8 .set 08h 
be .set Och 
b10 .set 010h 
b14 .set 014h 
b20 .set 020h 
b24 .set 024h 
b30 .set 030h 
b34 .set 034h 
hpiram -set 01000h 
int2msk .set 004h , INT2_ bit position on IFR 
KK KKK KKK KK KKK KK KKK KKK KKK KKK KKK KKK KKK KKK KK KKKKKKKKKKK KKK KKK 
ss main program starts here 
KKK KK KKK KK KKK KK KKK KKK KK KKK KK KKK KKK KEK KKK KK KKK KKKKKKKKKKK KKK 
.sect "pootload” 
boot 
ssbx intm ; Gisable all interrupts 
ld #0, dp 
stm #boota, @nmintv 
orm #03b00h, @st1l ; xf=1, mh=1, intm=1, ovm=1, sxm=1 
orm #020h, @pmst ; ovly=1 
stm #O07££fh, swwsr , 7 wait states for P_,D_, and I_ spaces 
stm #0f£800h, bscr ; full bank switching 
kK ee kK ke kK kX KK KK KK KK kK KK KK KK KK KH KK kK KK KK KK KR KR *K 
fs HPI boot, simply branch to HPI RAM 
i a a a i a a a a a a a a, a a a a a a a a a a a, a a a ed 
ot C542 
stm #01010b, hpic ; Send HINT_ low 
rpet #18 7 wait 20 clockout cycles 
nop 
bite @ifr, #int2msk , Check if INT2_ flag is set 
; ThisTEST MUST BE >= 30 cycles from boot? 
stm #int2msk, ifr ; Clear INT2_ bit in ifr if INT2_ latched 
bed endboot, tc ; If yes, branch to HPI RAM 
st #hpiram, @dest 
-endif 
kK * kk kK * F RK KK & *F KK kK KF K KK KE KK KH Kk KK RK KF KK & * 
x Read Configuration Byte 
Ke 8 kK ke e FF Re Ke ek OR Ke eR OK KK RK wk we KR KR BF Hk KK K& 
portr #0ffffh, @brs + brs <-- boot load configuration word 
1d @brs, 8, a 7 get boot value in acc AL 
and #O0fcO00Oh, a ; throw away 2 LSBs 
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Example 8-1. Warm Boot Option (Continued) 

stl a, @src ; Save as source address 

ldu @brs, a ; determine bootload method 

and #3, a ; if 2 LSBs == 00 

be ser_io, aeq ; use serial or parallel I/O 

sub #2, a ; if 2 LSBs == 01 

be par08, alt ; load from 8-bit memory 

; if 2 LSBs == 10 
be parl6, aeq : load from 16-bit memory 


7 
i a a a i a a a a a a a a a a a ad 


* Warm-boot, 


KK KK KK KK KK KR KK KK KK 


warmboot 
delay @src ; 
endboot 
ld @dest, a 7 
bacc a 


i a a a i a a a a a a a a a a a ad 
* 


i i a a a a a a a a a a a a a a ad 


parl6 
mvdk @src, arl 7 
ld *arlt+, a ; 
stl a, @dest ; 
stlm a, ar2 ; 
ld *arlt+t, a 7 
stlm a, bre ; 
nop , 
rptb xfr1l6-1 
mvdk *arlt+, ar3 7 
1ldm ar2, a i 
add #1, a ; 
stim a, ar2 - 
sub #1, a ; 
writa @ar3 ; 
xfrl6 
b endboot 


Kk kK kK kK KK KK KK KK KR KK KKK 
* Bootload from 8-bit memory, 
kK kK kK kK KK KK KK KKK KKK KK 


par0s8 
mvdk @src, arl 7 
ld *arlt+, 8, a : 
mvdk *arlt+, ar3 7 
andm #0ffh, @ar3 ; 
or @ar3, a ; 
stl a, @dest ; 
stlm a, ar2 7 
lid *arlt+, 8, a ; 
mvdk *arlt+, ar3 ; 
andm #0ffh, @ar3 : 
or @ar3, a ° 
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simply branch to source address 


Bootload from 16-bit memory 


else 2 LSBs als 


KK eK eK KK Ke KK KO 


* 
kk & BK # F & ® & ® * 


dest <-- src 


branch to destination address 


kK kK kK eK kK Kk OK 
* 


kK kK kK eK kK Kk OK 


arl points at source memory (Data) 
load accumulator A with destination 
save to scratchpad 

current destination in block repeat 
get the length 

update block repeat counter register 
bre latency 


read object data 

get previous destination address 
these instructions also 

serve the purpose of inserting 
10 cycles b/w read & write 

write object data to destination 


kk & K & ® & KK & ® * 


MS byte first 


kK kK eK eK KK KK KK OK 


* 


arl points at source memory (Data) 
load accumulator A with destination 
ars <-- junkbyte.low byte 

ar3 <-- low byte 

acc A <-- high byte.low byte 

save to scratchpad for endboot 
ar2 points at destination 

get number of 16-bit words 

ar3 <-- junkbyte.low byte 

ar3 <-- low byte 

acc A <-- high byte.low byte 
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Example 8-1. Warm Boot Option (Continued) 


stim a, bre ; update block repeat counter register 
nop , bre update latency 
rptb eloop4-1 
ld *arl+, 8, a , acc A <-- high byte 
mvdk *arlt+, ar3 7 ax3 <-- junkbyte.low byte 
andm #0ffh, @ar3 ; axr3 <-- low byte 
or @ar3, a , acc A <-—- high byte.low byte 
stl a, @p8word 
1ldm ar2, a ; acc A <-—- destination address 
nop ; 10 cycles b/w read & write 
writa @p8word 7 write object data to destination 
add #1, a 
stlm a, ar2 7 update destination address 
eloop4 
b endboot 
i i a a a i i a, a i i a a a, a i a a a en a a a a ae 2 
ser_io 
ld @brs, a 
and #0fh, a ; clear except lower 4 bits 
stl a, @bootmode 7 save only boot mode 
cmpm @bootmode, #b8 ; test for io boot 8 
be pasync08, tc ; if set, perform parallel I/O bootload-8 
cmpm @bootmode, #bc ; test bit #3 of configuration word 
be pasyncl6é, tc ; if set, perform parallel I/O bootload-16 
& * Ke ® KK RK K kK RK * & KK &® KK KX KK € KK Ke X 
* Bootload from serial port * 
BOR OR OR Boe OR BE BR ek RR OR OR ke oe OK KK ee & KF 
ser 
ld @brs, a 
and #3fh, a ; Clear except lower 6 bits 
stl a, @bootmode 7 save only boot mode 
cmpm @bhootmode, #b0 ; test bit #0 of bootmode word 
bed bspO8int, tc ; if set, then 8-bit serial with int BCLKX, BFSX 
andm #O0ff01h, @spc ; clear bits 1-7 
cmpm @bootmode, #b4 ; test bit #2 of bootmode word 
be bsploint, tc ; if set, then 16-bit serial with int BCLKX, BFSX 
cmpm @bootmode, #b10 ; test bit #4 of bootmode word 
be bspO08ext, tc ; if set, then 8-bit serial with ext BCLKX, BFSX 
cmpm @bootmode, #b14 ; test bit #4&2 of bootmode word 
be bspl6oext, tc ; if set, then 16-bit serial with ext BCLKX, BFSX 
cmpm @bootmode, #b20 ; test bit #5 of bootmode word 
bed tdm0sint, te ; if set, then 8-bit serial with int TCLKX, TFSX 
andm #O0ff00h, @tspc ; clear bits 0-7 
cmpm @bootmode, #b24 ; test bit #5&2 of bootmode word 
be tdml6int, te ; if set, then 16-bit serial with int TCLKX, TFSX 
cmpm @bootmode, #b30 ; test bit #5&4 of bootmode word 
be tdm08ext, tc ; if set, then 8-bit serial with ext TCLKX, TFSX 
cmpm @bootmode, #b34 ; test bit #5&4&2 of bootmode word 
be tdml6éext, tc ; if set, then 16-bit serial with ext TCLKX, TFSX 
b bootend 
we we Ke Re RB KR KK KR K RK Re KK KR KR KK KEK KR KK KK KR KK KH KK KR K KR K 
* Bootload from Buffered Serial Port (BSP) * 


OK RB 8 oR RK Ee KK KK KF KK KK KK RK KF RK K KK KR KK KK KF 
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Warm Boot Option (Continued) 


; configure sport and put in reset 


PRELIMINARY 


; CLKKV=3, FSP=CLKP=FE=F IG=PCM=BXE=HLTX=BRE=HLTR=0 


; 16-bir service routine addr 


; configure sport and put in reset 


; CLKKV=3, FSP=CLKP=FE=F IG=PCM=BXE=HLTX=BRE=HLTR=0 


; 88-bit service routine addr 


; configure sport and put in reset 


; CLKKV=3, FSP=CLKP=FE=F IG=PCM=BXE=HLTX=BRE=HLTR=0 


; 16-bit service routine addr 


; configure sport and put in reset 


7; CLKKV=3, FSP=CLKP=FE=F IG=PCM=BXE=HLTX=BRE=HLTR=0 


; 88-bit service routine addr 


; take sport out of reset 
; Signal ready-to-receiv 


; Clear flag 


; begin receive data routine 
; if rrdy = 1 


7 vector to the sport receive routine 
(16-bit serial mode) 


; get destination addr into arl 
; save destination addr 


; next service routine 


serial mode) 


7 get length in words 
; add destination address 
; Save end address 


; next service routine 
; get the destination address 


; check for RAM full condition 
; Spin if transfer not complete 


7 write object word at destination addr 


; increment destination addr 


Warm Boot 
Example 8-1. 
bspl6ext 

orm #0008h, @spc 

stm #0003h, spce 

bd bspselfl 

st #bspaddl6, @state 
bsp08ext 

orm #000ch, @spc 

stm #0003h, spce 

bd bspselfl 

st #bspadd8_1, @state 
bspl6int 

orm #0038h, @spc 

stm #0003h, spce 

bd bspselfl 

s #bspaddl6, @state 
bspO08int 

orm #003ch, @spc 

stm #0003h, spce 

st #bspadd8_1, @state 
bspselfl 

orm #0080h, @spc 

rsbx xf 
* Poll for receive data ready 
bspin 

rsbx tc 
bspinn 

bcd bspinn, ntc 

bitft @spc, #0400h 
bsprcev_isr 

ld @state, a 

bacc a 
* Load destination address 
bspadd16 

mvdk @drr, arl 

mvkd arl, @dest 

bd bspin 

st #bsplenl6, @state 
* Load end address (16-bit 
bsplenl6 

1ldm arr, 2 

add @dest, a 

stlm a, arO 

bd bspin 

st #bspisrl6, @state 
*  Bootload 16-bit serial data 
bspisr16 

ldu @arl, a 

cmpr eq, arl 

bcd bspin, ntc 

writa @drr 

mar *arlt+ 

b endboot 
8-16 
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Example 8-1. Warm Boot Option (Continued) 


* Load destination address (8-bit serial mode) 


bspadd8_1 
ld @drr, 8, a 7 acc A <-—- junkbyte.high byte 
and #0ff00h, a ; acc A <-—- high.byte 
stl a, @hbyte 7 save high byte 
bd bspin 
st #bspadd8_2, @state ; next service routine 
bspadd8_2 
ldu @drr, a 7, acc A <-- junkbyte.low byte 
and #0ffh, a ; acc A <-—- low byte 
or @hbyte, a 7 acc A <-—- high byte.low byte 
stlm a, arl ; save destination address 
stl a, @dest 
bd bspin 
st #bsplen8_1, @state y; next service routine 
* Load end address (8-bit serial mode) 
bsplen8_1 
id @drr, 8, a , acc A <-- high byte 
stl a, @hbyte 7 save high byte 
bd bspin 
st #osplen8_2, @state ; next service routine 
bsplen8_2 
ldu @drr, a 7, acc A <-- junkbyte.low byte 
and #0ffh, a ; acc A <-—- low byte 
or @hbyte, a 7 acc A <-—- high byte.low byte 
add @dest, a ; add destination address 
stlm a, ar0 ; save end address 
bd bspin 
st #bspisr8_1, @stat ; next service routine 
*  Bootload 8-bit serial data 
bspisr8_1 
ld @drr, 8, a , acc A <-- high byte 
stl a, @hbyte 7 save high byte 
bd bspin 
st #bspisr8_2, @state ; next service routine 
bspisr8_2 
ldu @drr, a , acc A <-—- junkbyte.low byte 
and #0ffh, a ; acc A <-—- low byte 
or @hbyte, a 7 acc A <-- high byte.low byte 
stl a, @s8word ; save 16-bit word 
1ldm arl, a , get destination addr 
cmpr eq, arl ; check for RAM full condition 
bed endboot, tc ; exit if RAM full 
writa @s8word 7 copy word to pmem[<arl>] 
mar *arlt+ 
bd bspin 
st #bspisr8_1, @state ; next service routine 
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Example 8-1. Warm Boot Option (Continued) 


i a a a a a a a a i a a a a a a a a a a a a a a a a a ed 


*  Bootload from TDM Serial Port (TDM) * 
Se a a a a a a a a, a a a a a a a a a a a, a a Se a a a a ad 
tdml6ext 
orm #0008h, @tspc ; configure sport and put in reset 
bd tdmselfl 
St #tdmadd1l6, @state ; 16-bir service routine addr 
tdm08ext 
orm #000ch, @tspc ; configure sport and put in reset 
bd tdmselfl 
St #tdmadd8_1, @state ; 8-bit service routine addr 
tdml6int 
orm #0038h, @tspc ; configure sport and put in reset 
bd tdmselfl 
st #tdmadd1l6, @state ; 16-bit service routine addr 
tdm08int 
orm #003ch, @tspc ; configure sport and put in reset 
st #tdmadd8_1, @stat ; 8-bit service routine addr 
tdmself1l 
orm #0080h, @tspc ; take sport out of reset 
rsbx xf ; Signal ready-to-receive 
* Poll for receive data ready 
tdmspin 
rsbx te ; Clear flag 
tdmspinn 
bcd tdmspinn, ntc 7 begin receive data routine 
bitf @tspc, #0400h ; if rrdy = 1 
tdmrcv_isr 
ld @state, a 7 vector to the sport receive routine 
bacc a 
* Load destination address (16-bit serial mode) 
tdmadd16 
mvdk @trev, arl ; get destination addr into arl 
mvkd arl, @dest ; save destination addr 
bd tdmspin 
st #tdmleni6, @state ; next service routine 
* Load end address (16-bit serial mode) 
tdmlenl6 
1ldm trcev, a 7 get length in words 
add @dest, a ; add destination address 
stlm a, ar0 ; Save end address 
bd tdmspin 
st #tdmisrl6, @state ; next service routine 
*  Bootload 16-bit serial data 
tdmisr16 
Law @arl, a 7 get the destination address 
cmpr eq, arl ; check for RAM full condition 
bed tdmspin, ntc ; spin if transfer not complete 
writa @trcv 7 write object word at destination addr 
mar *ari+ ; increment destination addr 
b endboot 
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Warm Boot Option (Continued) 


Example 8-1. 
* Load destination address 
tdmadd8_1 
ld @trcev, 8, a ; 
and #0ff00h, a 7 
stl a, @hbyte ; 
bd tdmspin 
st #tdmadd8_2, @state ; 
tdmadd8_2 
ldu @trcv, a ‘ 
and #0ffh, a r 


or @hbyte, a ; 


stim a, arl . 
stl a, @dest bd tdmspin 
st #tdmlen8_1, @state ; 
* Load end address 
tdmlen8sg_1 
ld @trcev, 8, a , 
stl a, @hbyte ; 
bd tdmspin 
st #tdmlen8_2, @state ; 
tdmlen8sg_2 
ldu @trcv, a : 
and #0ffh, a c 
or @hbyte, ad : 
add @dest, a ; 


stlm a, ar0 4 

bd tdmspin 

st #tdmisr8_1, @state ; 
tdmisr8_1 

ld @trcev, 8, a 

stl a, @hbyte ; 

bd tdmspin 

st #tdmisr8_2, @state ; 
tdmisr8_2 

1dau @trcv, a ‘ 

and #0ffh, a - 

or @hbyte, a . 

stl a, @s8word ; 

1ldm arl, a ; 


cmpr eq, arl ; 
bed endboot, tc ‘ 
writa @s8word 7 
mar *arlt+ 

bd tdmspin 

st #tdmisr8_1, @state ; 
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(8-bit serial mode) 


acc A <-- junkbyte.high byte 


acc A <-- high.byte 
save high byte 


next service routine 


acc A <-- junkbyte.low byte 


acc A <-- low byte 


acc A <-- high byte.low byte 
save destination address 


next service routine 
(8-bit serial mode) 


acc A <-- high byte 
save high byte 


next service routine 


acc A <-- junkbyte.low byte 


acc A <-- low byte 


acc A <-- high byte.low byte 
add destination address 


save end address 


Warm Boot 


next service routine* Bootload 8-bit serial data 


;acc A <-- high byte 


save high byte 


next service routine 


acc A <-- junkbyte.low byte 


acc A <-- low byte 


acc A <-- high byte.low byte 


save 16-bit word 


get destination addr 
check for RAM full condition 


exit if RAM full 


copy word to pmem[<arl>] 


next service routine 


Bootloader 
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Warm Boot 


Example 8-1. 


Warm Boot Option (Continued) 


ke & K  F * & & KK eK &F K RK RK BK KK wk KK KK KR KB KR K RK * 


*  Bootload from parallel I/O port (pa0) 


i a a a i i a a, a a a a i a, a a a a a a 


*  Bootload from I/O port 


pasyncl6 
call 
portr 
call 
portr 
mvdk 
ldu 

loop16é 
call 
portr 
ssbx 
rpt 
nop 
writa 
add 
banz 
ldu 
bacc 


Bootload from I/O port 


handshake 


paO, @dest : 
handshake 

paoO, @lngth ; 
@lngth, arl i; 
@dest, a ; 


handshake 


paO, @temp ; 
xf ; 
#8 

r 
@temp ; 
#1, a ; 
loop16, *arl- ; 
@dest, a ; 
a 


(8-bit parallel), 


(16-bit parallel) 


read word from port to destination 


read word from port to length 


arl <-- code length 


acc A <-- destination address 


read word from port to temp 
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acknowledge word as soon as it’s read 


10 cycles delay between xf and write 


write word to destination 


increment destination address 


loop if arl is not zero 


branch to destination addess 


get destination address from lst two byte 


call handshake 

portr pa0O, @hbyte 

ld @hbyte, 8, a : 

stl a, @hbyte ; 

call handshake 

portr pad, @dest 

ldu @dest, a F 

and #0ffh, a ; 

or @hbyte, a ; 

stl a, @dest ; 
* get code length from 2nd two 

call handshake 

portr pad, @hbyte 

ld @hbyte, 8, a F 

stl a, @hbyte : 

call handshake 

portr pad, @lngth 

ldu @lngth, a ‘ 

and #0ffh, a ; 

or @hbyte, a : 

stl a, @lngth iH 

stlm a, arl ; 

ldu @dest, a 

ld a, b ; 
8-20 


read high byte from port 
save high byte 


read low byte from port 
clear upper byte 

combine high and low byte 
save destination address 
byte 


read high byte from port 
save high byte 


read low byte from port 
clear upper byt 

combine high and low byte 
save code length 

arl <-- code length 


acc B <-- destination address 


MS byte first pasync08 
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Example 8-1. Warm Boot Option (Continued) 
loop08 
call handshake 
portr pa0, @hbyte 
ld @hbyte, 8, a ; vead high byte from port 
stl a, @hbyte 7 save high byte 
call handshake 
portr pa0, @temp 
ssbx xf ; acknowledge byte as soon as it’s read 
ldu @temp, a ; read low byte from port 
and #O0ffh, a ; clear upper byte 
or @hbyte, a ; combine high and low byte 
stl a, @temp ; save code word 
ld b, a ; acc A <-—- destination address 
nop 
nop ; 10 cycles delay between xf and write 
writa @temp ; write code word to program memory 
add #1, a ; increment destination address 
id a, b ; Save new destination address 
banz loop08, *arl- ; loop if arl not zero 
ldu @dest, a ; branch to destination address 
bacc a 
* Handshake with BIO signal using XF 
handshake 
ssbx xf ; acknowledge previous data word 
biohigh 
be biohigh, bio , wait till host sends request 
rsbx xf ; indicate ready to receive new data 
biolow 
re bio ; wait till new data ready 
b biolow 
bootend 
-end 
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Bootloader 


Warm Boot 
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Host—Target Communication 


This chapter describes the communication interface between the ’54x EVM 
and its host. The system passes data between the target and host, while main- 
taining real-time operation. The system can be driven by interrupts, polled, or 
a mixture of the two. 


Topic Page 
9.1. Communication Channels .............0cc cece cece eee e eens 9-2] 
9.2 Handshake and Data Transfer .......... 000 eee e eee eee eee 9-6 
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9.1 Communication Channels 


The host communicates to the 54x EVM via 38 16-bit I/O locations. Each I/O 
location is defined by an offset to an I/O page 0 address, shown in Table 9-1. 
There are two independent communication channels, A and B, through which 
the host and target can communicate. The status/control registers on both the 
host and target provide system-level control and status information. 


Table 9-1. ‘54x EVM Host-Interface Register Usage 


/O Offset From 


Base Address Register Size Register Type 
0x0800 Channel A 16 Read/write 
0x0804 Channel B 16 Read/write 


0x0808 Status/control 16 Read/write 


The host writes to channel A (offset 0x0800) and overwrites the current value. 
An interrupt 1 (INT1) signal is generated to the target, which sets the channel 
A transmit status bit, AXST, to 1 in the host control register (HCR) and sets the 
channel A receive status bit, ARST, to 1 in the target control register (TCR). 
The host reads from the same location, which clears the ARST bit in HCR and 
clears AXST in TCR. Channel B is a 64-word deep, bidirectional FIFO register 
that transfers both data and commands. Host write to 0x0804 is buffered by 
the FIFO. Data is ignored if the FIFO is full. Figure 9-1 shows the HCR and 
Table 9-2 describes the bits. 


Figure 9-1. Host Control Register (HCR) Diagram 


13 12 11 10 9-8 7 6 5 4 3-2 1 0 


15 14 
RESET | mp/Mc | BTIE | ATIE | HBIO | BRST2 | BRST | REV1 | REVO | XE | B10 | Bxst | aRsT | AXsT 
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PRELIMINARY Communication Channels 


Table 9-2. Host Control Register (HCR) Bit Summary 


Bit Name Description 
15 RESET Software reset. If RESET = 1, the target processor and emulation logic are reset 


but the host/target communication flags are not reset. 


14 MP/MC Microprocessor/microcomputer mode select. The EVM powers up in microcomputer 
mode. 
13 BTIE Channel B target interrupt enable. If BTIE = 1, channel A receive conditions generate 


a host interrupt. 


12 ATIE Channel A target interrupt enable. If ATIE = 1, channel A receive conditions generate 
a host interrupt. 


11 HBIO Host BIO input to target processor 
10 BRST2 Channel B receive status bit 2. If BRST2 =1, the target has written to channel B, 
forcing an interrupt. The BRST2 flag is cleared when the host reads its channel B. 
9-8 BRST Channel B receive status: 
BRST 
 Bitg Bits Channel B Receive Status 
0 0 Buffer empty 
0 1 Buffer less than half full 
1 0 Buffer half or more than full 
1 1 Buffer full 
7 REV1 Card revision status bit 1 
6 REVO Card revision status bit 0 
5 XF External flag from target processor (status) 
4 BIO BIO input to target processor 
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Table 9-2. Host Control Register (HCR) Bit Summary (Continued) 


Bit Name Description 
3-2 BXST Channel B transmit status: 
BXST 
Bit 3 Bit 2 Channel B Transmit Status 

0 0 Buffer empty 

0 1 Buffer less than half full 

1 0 Buffer half or more than full 

1 1 Buffer full 
1 ARST ChannelA receive status. If ARST =1, the target has written to its channel A register. 


The ARST flag is cleared when the host reads channel A. 


0 AXST Channel A transmit status. If AXST = 1, host has written to its channel A register. 
The AXST flag is cleared when the target reads channel A. 


The EVM supports two communication channels, configured as six I/O ports 
for host/target communication and 16 I/O ports for user expansion. Channel 
A is a single 16-bit bidirectional register mapped into two I/O port locations. 
Channel B is a single, bidirectional, 64-deep, FIFO buffer that is mapped into 
two I/O port locations. A status I/O port provides target control and general- 
purpose control, status, and discrete-bit I/O. Figure 9-3 shows the TCR and 
Table 9-3 describes the bits. 


Figure 9-2. '54x EVM Port Usage 


“PortAddress Name Usage 
0x0010 Channel A Communications 
0x0012 Channel B Communications 
0x0014 Status Target status/control 


Figure 9-3. Target Control Register (TCR) Diagram 


15 14 13 12 11-8 f 6 5-4 32 1 0 
AICRST | USR-BOT2 | USR-BOT1 | USR-BOTO | Reserved | USR-BIN1 | USR-BINO | BRST | BXST | ARST | AXST 
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Table 9-3. Target Control Register (TCR) Bit Summary 


Bit Name Description 
15 AICRST If AICRST = 0, the analog interface circuit is reset 
14 USR-BOT2 — User discrete output bit 2 
13 USR-BOT1 _ User discrete output bit 1 
12 USR-BOTO — User discrete output bit 0 
11-8 = Reserved 
rs USR-BIN1 User discrete input bit 1 
6 USR-BINO User discrete input bit 0 
5-4 BRST Channel B receive status: 
BRST 
Bits Bit4— Channel B Receive Status 
0 0 Buffer empty 
0 1 Buffer less than half full 
1 0 Buffer half or more than full 
1 1 Buffer full 
3-2 BXST Channel B transmit status: 
BXST 
Bits Bit2_ Channel B Transmit Status 
0 0 Buffer empty 
0 1 Buffer less than half full 
1 0 Buffer half or more than full 
1 1 Buffer full 
1 ARST ChannelA receive status. If ARST =1, the host has written to its channel A register. 


The ARST flag is cleared when the target reads channel A. 


0 AXST Channel A transmit status. If AXST = 1, target has written to its channel A register. 
The AXST flag is cleared when the host reads channel A. 


Note: For further register and |/O information, see the TMS320C54x Evaluation Module Technical Reference. 
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9.2 Handshake and Data Transfer 


Example 9-1 through Example 9—4 show how to communicate between the 
host and the target through both channels, A and B. The communication in- 
volves two steps: a handshake and a data transfer. Channel A sends com- 
mands between the host and the target and channel B uses the FIFO buffer 
to transfer data, either 64 or 32 words at a time. A buffer of 256 samples is 
transferred from target to the host. Data is sent 32 words at a time from the 
target, except for the first FIFO, where the first 64 words are sent to the host. 
The data is transferred from the target to the FIFO whenever an INT1 signal 
occurs. This is generated when the host writes to channel A. The XF line on 
the host control sets up a handshake between the host and the target. 
Figure 9—4 illustrates the sequence of events in a handshake and Figure 9—5 
illustrates the sequence of events in a data transfer. 
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Figure 9-4. Handshake Protocol 


Read channel B until the FIFO is empty. 
Write command to channel A (AXST = 1 
generates INT1). 


ARST = 1, since host has written to 
channel A polls IFR 

Read channel B until the FIFO is 
empty. 

Read channel A for the command 
send by host (ARST = 0). 

Send a handshake command to host 
through channel A (AXST = 1). 

Set XF to low. 


AXST = 0, target has read the 
command; poll XF bit. 

ARST = 1, target has written a 
command to channel A. 

Read handshake command from 
channel A (ARST = 0). 


AXST = 0, since host has read 
the command. 
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Example 9-1. Handshake — Target Action 


KKKKK KKK KKK KKK KKK KKK KKK KKK KK KKK KKK KKK KK KKK KKK KK KKK KKK KKK KKK KKK KKK KK KKK KKK 
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* This file includes the TCR register configuration of EVM 
KKKKKKKKKKKKKKKKKKKKKKKKKKKK KKK KKK KKK KKK KKK KKK KKK KK KKK KKKKKKKKKKKKKKKEKKEK 


K_AIC_RST 
K_USR_BOT 


K_RESRV 
K_USR_BIN 
K_RCV_BRST 


K_XMT_AXST 
K_TCR_HIGH 
K_TCR_LOW 
K_TCR 


K_O 
K_FIFO_FULL 


K_FIFO_EMPT 


K_AXST_CLEAR 


K_HANDSHAKE 
K_CHB 


K_CHA 


K_TRGCR_ADD 


K_AXST 
K_ARST 
K_BXST 
K_FIFO_SIZE 
K_FRAME_SIZ 
K_HOST_FLAG 


-se 
-se 


-se 
-se 
-se 


-se 


.se 
.se 
.se 
.se 
.se 
KKK KKK KK KK KKK KKK KK KKK KKK KKK KKK KKK KKK KKK KKK KK KKK KKK KK KKK KKK KKKKKKAKKKK KKK KKK KKK 
* this includes I/O address of CH_A, CH_B and different commands that’s been 


* passed between host and the target 
KKK KK KKK KKK KK KKK KKK KK KKK KK KKK KK KKK KKK KK KKK KK KKK KKK KK KKK KKK KK KKKKAKKKKKKK KKK KKK 


-se 
-se 


Y 


-se 


-se 


-se 


-se 


-se 


R 


sSét 
.set 
»set 
.set 
E .set 
Set 


-se 


cv st 


cr GY Cr 


cv st Cr tT Cr ok ee 


cv st 


Ob << 15 
000b << 12 


0000b << 8 
O0b << 6 
00b << 4 
llb << 2 


Ob << 1 
Ob << 1 


K_AIC_RST|K_USR_BOT|K 


a 


if AICRST=0, aic is reset 
User discrete output bits 
0,1,2 

Reserved bits 

User discrete input bits 0,1 
Channel B receive status regs 
buffer half or more K_XMT_BXST 
Ch B trasnmit status register 
buffer half or more K_RCV_ARST 
Cc 
Cc 


h A receive register 
h A transmit register 
RESRV 


K_USR_BIN|K_RCV_BRST|K_XMT_BXST|K_RCV_ARST | K_XMT_AXST 


K_TCR_HIGH|K_TCR_LOW 


Oh 
OxFF 


OxEI 


ic) 


OxAE 


OxAB 
12h 


10h 


14h 


; TEXAS INSTRUMENTS INCORPORATED 


-mmregs 
- include "target.inc” 

- include “init_54x.ine” 
- include “interrpt.inc” 
.ref FIFO_DP 

.ref d_command_reg 
.ref d_command_value 
.def evm_handshake 
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constant 0 

Full FIFO command written by 
target 

Empty FIFO command 

written by host 

Clear AXST empty command 
written by the target 
handshake CMD written by host 
Use Channel B as I/O interface 
to 54x EVM for sending data 
Use Channel A as I/O interface 
to 54x EVM for send command 

to host 

Target status control register 
I/O address location 

Oh 

used to check the control bits 
check if K_FIFO_SIZE 
its a 64 FIFO 

Frame size 

if 0, then host interface 
is disabled 
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Example 9-1. Handshake — Target Action (Continued) 


, Functional Description 


: This initiates the handshake between the host(PC) and the target (DSP). 

7 The host writes a command to CH A. This generates an INT1 on the target. 

; The AXST bit on HCR is set to 1.The INT1 bit in IFR is polled if it is set 
; then it is cleared to clear pending interrupts. The FIFO is cleared 

; by reading from the FIFO. The command from host is read thru CH A and ARST 
; on TCR is cleared. Another command from target is written to CH A, 

; which sets AXST. Also sets XF low. The host polls XF line. 

; The host reads CH A which clears ARST on host side and AXST on target side. 


-sect “handshke” 
evm_handshake: 
LD #0,DP 
BITF IFR,O2h ; Poll for INT1 
BC evm_handshake, NTC ; ARST = 1 
ST™ #K_INT1,IFR ; Clear the pending interrupt 
LD #FIFO_DP,DP 
RPT #K_FIFO_SIZE-1 
PORTR K_CHB,d_command_reg 7 assures that FIFO is empty to 
PORTR K_CHA,d_command_value ; ARST = 0 
target_handshake_command: ; vead the command from 
7 to acknowledge INT1 
PORTR K_TRGCR_ADDR, d_command_reg ; while (port14 & ARST) 
BITF d_command_reg, K_ARST ; check FIFO empty 
BC target_handshake_command, TC ; branch occurs 
LD #K_HANDSHAKE _CMD,A ; indicate of FIFO empty 
SUB d_command_value,A 
bad_handshake_command 
BC bad_handshake_command, ANEQ ; vead the command send by hosts 
ST #K_AXST_CLEAR, d_command_reg ; send to a command to clear AXST 
PORTW d_command_reg, K_CHA 7 write command to command reg A 
; AXST = 1 
RSBX XF , XF = 0 
RET 
.end 
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Example 9-2. Handshake — Host Action 


/* 
This function initializes the data buffer and reads the FIFO so that FIFO 
is empty when the real data transfers starts 


a a a ee */ 
void initialize_slave (void) 
{ 
ant: 37 
for (j=0;j3 < 64; 4 J++) 
dataa[j] = inport (BDAT_REG) ; /* read data from data reg. */ 
for (43=0; 3 <256; jt+t) 
dataa[j] = 0; 


outport (CONT_REG, inport (CONT_REG) & Oxf7ff); 


This initiates the handshake between the target and host. The host writes a command 
to target which sets the AXST flag to 1. The INT1 is generated whenever 

host writes to CH A. On the target side, INT1 is polled and reads the CH A. 

This clears ARST on target side. A command is written to Ch A on target after 
emptying the FIFO that sets AXSt =1. Later sets XF to go low. On host XF is polled 
and then reads CH A that clears ARST to 0 and AXST to O on the target side 

*/ 


int receive_clear_AXST (void) 


/* RECEIVE COMMAND FROM EVM ef 


{ 
command = OQxAB; 
outport (ADAT_REG, command) ; 


while (inport (CONT_REG) & AXST); /* write command to evm */ 
while ((inport (CONT_REG) & XF)); 
while(! (inport (CONT_REG) & ARST)); /* wait for evm to send command*/ 
reply = inport (ADAT_REG) ; /* vead command into reply */ 
while ((reply & OxAE) !=0xAE); 
return (reply) ; /* return command for process’ g*/ 

} 
[Ria a ee ee es */ 
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Figure 9-5. Data Transfer Protocol 


Handshake and Data Transfer 


Target action 


FIFO empty (BXST) 

{Read first 64 data from target into FIFO 

Send a command to host through channel A 

(AXST = 1) 

Initialize Counter to 1, since 64 data has already 
written to FIFO) 

} 


else 


discard the FIFO since there is a overrun on the input 
buffer 
} 


Read the FIFO full command through 
channel A (ARST = 0) 

Write the first 64 data from FIFO to data 
buffer 


AXST = 0, since host has read the 
command 


Send acommand to target after reading the 


FIFO through channel A (AXST = 1) 
Generates INT1 


INT1 occurs 


if (64< count < 256) 


{ 

Read the FIFO half- full command through 
channel A (ARST = 0) 

Send a command to target after reading the 
FIFO through channel A (AXST = 1) 

Generates INT 1} 


} 
ARST =0 
send data to screen 
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Target receives INT1 

Read the FIFO empty command from host through 
channel A (ARST = 0) 

if counter 7 

{ Read half FIFO (32 word) from target 
Send a command to host through channel A 
(ASXT = 1) 
increment counter 
return 

} 

else 

return 


Return form Interrupt 
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Example 9-3. Data Transfer — Target Action 


; TEXAS INSTRUMENTS INCORPORATED 


-mmregs 

- include "target.inc” 

.ref d_output_addr 
FIFO_DP -usect “fifo_var”,0 
d_command_reg -usect "fifo_var”,1 
d_command_value -usect "“fifo_var”,1 
d_fifo_count -usect “fifo_var”,1 
d_fifo_ptr -usect "fifo_var”,1 

.def fifo_host_transfer 

.def FIFO_DP 

.def d_command_reg 

.def d_command_value 

.def d_fifo_ptr 

.def d_fifo_count 


; Functional Description 
; This routine transfers a FIFO(64) of data to host thru CH B. 

7 In the process, after transferring data from DSP to FIFO sends a command 
; to host thru CH A.The host acknowledges and sends a command to target (DSP) 
i thru CH A. 


; The host transfer can be disabled by setting the K_HOST_FLAG =0 


-asg AR7, OUTBUF_P 
-asg AR7,SV_RSTRE_AR7 
-sect fifo Fil” 
fifo_host_transfer: 
LD #FIFO_DP,DP 
eit K_HOST_FLAG =1 
PORTR K_TRGCR_ADDR, d_command_reg ; while (port14 & BXST) 
BITF d_command_reg, K_BXST 
BC fifo_discard, TC ; FIFO discard 
MVDK d_output_addr, OUTBUF_P ; load PING/PONG bffr address 
RPT #K_FIFO_SIZE-1 ; write first set of 64 data 
; to FIFO 
PORTW *OUTBUF_P+, K_CHB ; Fill FIFO 
ST #K_FIFO_FULL, d_command_value 
PORTW d_command_value, K_CHA ; writecommand to comnd reg A 
ST #1,da_fifo_count ; start counting for transfers 
MVKD OUTBUF_P,d_fifo_ptr ; save the fifo_ptr 
fifo_discard 

.endif 

RET 

.end 
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Example 9-4. Data Transfer — Host Action 


command_FIFO receive_command_FIFO_FU 
for (fifo_size=0; fifo_size < 64; 
dataa[fifo_sizet+count ] 
send_command_new_FIFO (command) ; 

for (count=64; count< 256; count++) 
{ 
command_FIFO 
for (fifo_size=0; fifo_size < 32; 
dataa[fifo_sizet+count] 
count count+31; 
send_command_new_FIFO (command) ; 


fifo_ 


receive_command_FIFO_FULL()j; 


Handshake and Data Transfer 


LL (); 
sizet+t+) 
inport (BDAT_REG) ; 


/* command from target*/ 


fifo_sizett) 
inport (BDAT_REG) ; 


/* vead 32 word fifo*/ 


/* send command to target*/ } 


int receive_command_FIFO_FULL (void) 
/* RECEIVE COMMAND FROM EVM */ 
{ 
while(! (inport (CONT_REG) & ARST)); /* wait for evm to send command*/ 
reply = inport (ADAT_REG) ; /* read command into reply */ 
while ((reply & OxFF) !=0OxFF); 
return (reply); /* veturn command for process’ g*/ 
} 
[fe *./ 
/* This function sends a command to target for a new set of data from FIFO*/ 
void send_command_new_FIFO (command) 
unsigned int command; 
{ 
command = OxEE; 
outport (ADAT_REG, command) ; 
while (inport (CONT_REG) & AXST); 
} 
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Chapter 10 


Application Code Examples 


This chapter contains complete code examples for routines that are excerpted 
in previous chapters of this book. These routines have been developed using 
a 54x EVM platform. These programs demonstrate applications that use a 
host interface and run in real time. You may download them to use in develop- 
ing your own applications. 
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10.1 Running the Applications .......... 0. ccc eee eee 
We (Neyer ib iieenlS cecnaocsonscnudoousononuononsnanonuodenupane 
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10.1 Running the Applications 


10-2 


The host communicates to the 54x EVM through 16-bit I/O locations. Each I/O 
location is defined by an offset to an I/O page 0 address. The offset used for 
these applications is 0x240 + 0x800 for channel A, 0x240 + 0x804 for channel 
B, and 0x240 + 0x808 for the target/status control register, where 0x240 is the 
base address of the host. Check your PC system documentation to make sure 
that I/O space 0x240 does not conflict with other I/O devices. If the EVM is 
mapped to other than space 0x240, the base addresses of the control and 
status registers must be modified in the file host.h. 


The ’54x assembler assembles code and generates object files. The linker 
command file links the object files and generates a file named main.out, using 
common object file format (COFF). You must load main.out into the EVM 
debugger command window with the LOAD command and compile the host 
software using a Borland C compiler. This generates the file, master.exe. that 
contains graphic routines that display data transferred from the target to the 
host. 


To run the target application, load main.out into the EVM debugger. To start the 
program on the target side, press the F5 function key. If you halt the program 
with the escape key or use the halt command in the debugger window, the pro- 
gram remains in the handshake loop waiting for the host to send a command. 
Press the F5 function key to continue. To run the host application, execute 
master.exe at the DOS prompt or a window command line. When master.exe 
is executed by the host, it displays the message: 


Graphics: No error. Press any key to halt. 


When you press this key, the graphics window opens and displays data for the 
task the target has initiated. The default task is an oscilloscope routine. To 
change to a different task, go to the debugger window, halt the program, and 
in the command window, type: 


e *present_command = x 


where x = 1, 2, 3, 4, 5 or 6, and present_command has one of the following 
values: 

1 = oscilloscope 

2 = Low-pass finite impulse response (FIR) filter using MAC instruction 

3 = Low-pass infinite impulse response (IIR) filter using biquad sections 

4 = Lowpass FIR filtering using FIRS instruction 


5 = System identification using adaptive filtering with least mean squares 
(LMS) instruction 


6 = 256-point real fast Fourier transform (FFT) 
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You can view the output of the present task in the graphics window. 


To exit the host application, press F3. Communication is lost if at any time the 
target code is reloaded or reset in the command window while the host execut- 
able is running in the background. This means that if you attempt to reset or 
reload the code in the debugger window and you press F5, the computer locks 
up. This occurs because there is no handshake between the host and the tar- 
get. To unlock, reload and run the code (press F5) on the target side. On the 
host side, quit the window and rerun the executable. 


The adaptive filter can be tested in two steps. The initial step size d_mu = 0 
in the first step. If the present task is changed at the debugger window with the 
command e *present_command = 5, runs with d_mu = 0. Thus, the system is 
not identified since the coefficients of the adaptive filter are not updated. In the 
second step the step size can be changed by typing e *d_mu = 0x1000 at the 
command window. In this case, the system is identified and the filter coeffi- 
cients are adapted using the LMS algorithm. In both cases the error signal can 
be observed both on host and also from the output of the ’ACO1. 


Application Code Examples 10-3 


Application Code 


10.2 Application Code 


Table 10-1. 
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Table 10-1 lists programs appropriate for running on a target system and tells 


you where to look for them in this chapter. 


Target Files 


Title 


Vector Table Initialization 

Main Program That Calls Different Functions 
Memory Allocation for Entire Application 
Processor Initialization 

Initialization of Variables, Pointers, and Buffers 
Initialization of Serial Port 1 

"AC01 Initialization 

*ACO1 Register Configuration 

Receive Interrupt Service Routine 

Task Scheduling 

Echo the Input Signal 

Low-Pass FIR Filtering Using MAC Instruction 


Low-Pass Biquad IIR Filter 


Low-Pass Symmetric FIR Filtering Using FIRS Instruction 


Adaptive Filtering Using LMS Instruction 
256-Point Real FFT Initialization 

Bit Reversal Routine 

256-Point Real FFT Routine 


Unpack 256-Point Real FFT Output 


Compute the Power Spectrum of the Complex Output 


of the 256-Point Real FFT 


File Name 


vectors.asm 
main.asm 
memory.asm 
init_54x.asm 
prcs_int.asm 
init_ser.asm 
init_aic.asm 
aic_cfg.asm 
rcv_intt.asm 
task.asm 
echo.asm 
firasm 
iirasm 
sym_fir.asm 
adapt.asm 
rfft-asm 
bit_rev.asm 
fft-asm 
unpack.asm 


power.asm 


Page 
10-6 
10-16 
10-10 
10-22 
10-29 
10-33 
10-38 
10-42 
10-46 
10-51 
10-56 
10-59 
10-69 
10-64 
10-74 
10-84 
10-87 
10-91 
10-97 
10-103 


PRELIMINARY 


PRELIMINARY Application Code 


Table 10-2 lists programs appropriate for running on a host system and tells 
you where to look for them in this chapter. 


Table 10-2. Communication Interface Files 


Title File Name Page 

Handshake Between Host and Target hand_shk.asm 10-25 

Interrupt 1 Service Routine hst_intl.asm 10-111 
Data Transfer from FIFO fifo.asm 10-106 
Main Function Call on Host Side master.c 10-118 
Function Calls on Host Side host.c 10-116 
Display the Data on the Screen view2.c 10-123 
Graphic Drivers Routine graphic2.c 10-121 


Example 10-28 on page 10-124 shows the linker command that links all object 
files together to produce a single executable COFF object module. This file es- 
tablishes the memory configuration for the entire application, using the ’541’s 
memory map. Example 10-29 on page 10-127 shows the configuration of the 
memory map for a 541 device used by an EVM debugger. 
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Example 10-1. Vector Table Initialization 


4 TEXAS INSTRUMENTS INCORPORATED 
: DSP Data Communication System Development / ASP 


; Archives: PVCS 

; Filename: vectors.asm 

; Version: 10 

; Status: draft () 

: proposal (X) 

; accepted () dd-mm-yy/?acceptor. 
, 

; AUTHOR Padma P. Mallela 

r 

; Application Specific Products 

7 Data Communication System Development 
H 12203 SW Freeway, MS 701 

7 Stafford, TX 77477; { 

7 IPR statements description (can be collected). 
i} 

, 

; (C) Copyright 1996. Texas Instruments. 

. All rights reserved. 

, 

i{ 

; Change history: 

r 

; VERSION DATE / AUTHORS COMMENT 
; 1.0 July-24-96 / P.Mallela original created 
, 

i} 

i{ 

i 1. ABSTRACT 

r 

; 1.1 Function Type 

; a. Core Routine 

; b. Subroutine 

, 

; 1.2 Functional Description 

; This file contains vector table of 541 

r 

r 

7 1.3 Specification/Design Reference (optional) 

r 

7 1.4 Module Test Document Reference 

; Not done 

, 

7 1.5 Compilation Information 

: Compiler: TMS320C54X ASSEMBLER 

A Version: 1.02 (PC) 

; Activation: asm500 -s vectors.asm 

r 

; 1.6 Notes and Special Considerations 

r{ 

Lf 

7 This code is written for 541 device. The code is tested on 
, C54x EVM 
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Example 10-1. Vector Table Initialization (Continued) 


~ 
aed 


; 2. VOCABULARY 
; 
7 2.1 Definition of Special Words, Keywords (optional) 
; = 
; 2.2 Local Compiler Flags 
; = 
; 2.3 Local Constants 
; Ss 
i} 
i{ 
H 3. EXTERNAL RESOURCES 
r 
A 3.1 Include Files 
-mmregs 
- include Minit 54% .ine" 
- include "main.inc” 
; 3.2 External Data 
.ref SYSTEM_STACK 
A 3.3 Import Functions 
.ref main_start 
.ref receive_intl 
.cef host_command_int1l 
i} 
i{ 
i. 4. INTERNAL RESOURCES 
r 
; 4.1 Local Static Data 
; _ 
; 4.2 Global Static Data 
; 7 
; 4.3 Dynamic Data 
; = 
} 4.4 Temporary Data 
; os 
; 4.5 Export Functions 
i} 
v 
; 5. SUBROUTINE CODE 
; HeaderBegin 
r 
. 5.1 reset 
, 
7 5.2 Functional Description 
: This function initializes the vector table of 541 device 
j ---------------------------------------------~------ ~~ ++ ---- ++ -- ~~~ +--+ +--+ 
, 
; 5.3 Activation 
H Activation example: 
r 
; Reentrancy: No 
A Recursive: No 
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Example 10-1. Vector Table Initialization (Continued) 


5.4 Inputs 
5.5 Outputs 
5.6 Global 


5.7 Special considerations for data structure 


5.8 Entry and Exit conditions 


|DP | OvM| SXM|C16|FRCT|ASM|ARO|AR1|AR2|AR3|AR4|AR5|AR6|AR7|A |B |BK|BRC 


LN cl Pe ee YO | 


| 
| 1 | o | o o |nu [Nu |NuU |NU |NU [NU |NU |NU |NU|NU|NU|NU |NU|NU 
| =| | | i 
| 1 | 0 | | 


| 
Ly (eae ere hacia nr Ite peade te esse al 


o |nu [nu [NU |NU |NU |NU |NU |NU |NU|NU|NU|NU 


T|TRN 


| 0 
| 
| o | 1 | 0 


0 NU 


Note : UM —- Used & Modified, U - Used, NU - Not Used 


5.9 Execution 
Execution time: ?cycles 
Call rate: not applicable for this application 


ST ee i i i i en ie i eh a Tee Ty 


; HeaderEnd 


; 5.10 Code 
-sect “vectors” 
reset: BD main_start ;RESET vector 
STM #SYSTEM_STACK, SP 
nmi: RETE 
NOP 
NOP a 
NOP ; NMI 
; software interrupts 
sint17 -space 4*16 
sint18 -space 4*16 
sint19 -space 4*16 
sint20 -space 4*16 
Ssant2 -space 4%*16 
sint22 -space 4*16 
sint23 -space 4*16 
sint24 -space 4*16 
sint25 -space 4*16 
sint26 -space 4*16 
sint27 -space 4%*16 
sint28 -space 4*16 
sint29 -space 4*16 
sint30 -space 4*16 
into: RETE 
NOP 
NOP ; INTO 
NOP 
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Example 10-1. Vector Table Initialization (Continued) 


intl : host_command_int1l ; Host interrupt 
STO 
STl ; INT1 


D 
SH 
Ss 


int2: 


tints RETE 


NOP ; TIMER 


rintod: RETE ; Serial Port Receive 
NOP ; Interrupt 0 


xintod: RETE ; Serial Port Transmit 
NOP ; Interrupt 0 


rint.Ls BD receive_intl ; Serial Port Receive 
PSHM STO ;Interrupt 1 
PSHM STL 
xintl: RETE ; Serial Port Transmit 
NOP ; Interrupt 1 


int3: RETE 


NOP ; INT3 
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Example 10-2. Memory Allocation for Entire Application 


; TEXAS INSTRUMENTS INCORPORATED 
; DSP Data Communication System Development / ASP 


; Archives: PVCS 


; Filename: memory.asm 

; Version: 1... 

; Status : draft - 

; proposa (X) 

; accepted (_) dd-mm-yy/?acceptor. 
, 

; AUTHOR Padma P. Mallela 

v 

; Application Specific Products 

; Data Communication System Development 
; 12203 SW Freeway, MS 701 

; Stafford, TX 77477 


i{ 

; IPR statements description (can be collected). 

ir} 

, 

7 (C) Copyright 1996. Texas Instruments. All rights reserved. 


i{ 
; Change history: 


; VERSION DATE / AUTHORS COMMENT 
; 1.0 July-24-96 / P.Mallela original created 
, 

i} 

i{ 

i 1. ABSTRACT 

, 

; 1.1 Function Type 

; a.Core Routine 

; b. Subroutine 

, 

: 1.2 Functional Description 

; This file contains main function 

, 

, 

7 1.3 Specification/Design Reference (optional) 
, 

: 1.4 Module Test Document Reference 

; Not done 

, 

7 1.5 Compilation Information 

: Compiler: TMS320C54X ASSEMBLER 

; Version: 1.02 (PC) 

7 Activation: asm500 -s memory.asm 

, 

; 1.6 Notes and Special Considerations 


; This code is written for 541 device. The code is tested on C54x EVM 
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Example 10-2. Memory Allocation for Entire Application (Continued) 


~ 
aed 


7 2. VOCABULARY 

, 

; 2.1 Definition of Special Words, Keywords (optional) 

; = 

; 2.2 Local Compiler Flags 

; = 

; 2.3 Local Constants 

; = 

i} 

i{ 

; 3. EXTERNAL RESOURCES 

, 

A 3.1 Include Files 
-mmregs 


-include "“defines.inc” 
-include "main.inc” 


; 3.2 External Data 

A 3.3 Import Functions 

i} 

i{ 

; 4. INTERNAL RESOURCES 

’ 

A 4.1 Local Static Data 

¥ 

- 4.2 Global Static Data 

; = 

; 4.3 Dynamic Data 

; a 

; 4.4 Temporary Data 

f 

: 4.5 Export Functions 

i} 

; 5. SUBROUTINE CODE 

; HeaderBegin 

, 

, 

}---------- oe ee ee ee 7 eee eases See ee = 
7 5.1 main_start 

, 

; 5.2 Functional Description 

7 Memory configuration of the application 
ptt aS a a a ee eee 
, 

; 5.3 Activation 

; Activation example: 

, 

; Reentrancy: No 
; Recursive : No 
, 

; 5.4 Inputs 
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Example 10-2. Memory Allocation for Entire Application (Continued) 


; 5.5 Outputs 

, 

; 5.6 Global 

, 

; 5.7 Special considerations for data structure 

, 

7 5.8 Entry and Exit conditions 

, 

f |DP | ovM| SXM|C16|FRCT|ASM|ARO|AR1|AR2|AR3|AR4|AR5|AR6|AR7|A |B |BK|BRC| T|TRN 
, 

ie We Nee = | | | | 
jin |u | 1 [Nu [Nu [NU |NU |NU |NU |NU |NU |NU |NU |NU |NU |NU|NU|NU|NU |NU|NU | 
, 

Pep tod oe eT Lo ed ee | 
;out |u | 1 | 1 [nu | 1) [nu [nu [nu |Nu |NU |NU |NU |NU |NU |NU|NU|NU|NU |NU|NU | 
, 

; Note : UM - Used & Modified, U - Used, NU - Not Used 

, 

H 5.9 Execution 

: Execution time: ?cycles 

; Call rate: not applicable for this application 

, 

, 

; HeaderEnd 

; 5.10 Code 

STACK -usect "“stack”,K_STACK SIZE 

SYSTEM_STACK set K_STACK_SIZE+STACK 

input_data -usect “inpt_buf”,K_FRAME_SIZE*2 ; input data array 
output_data -usect “outdata”,K_FRAME SIZE*2 ; output data array 


; this section of variables are used in receive_intl routine and related routines 
RCV_INT1_DP -usect "rcov_vars”,0 


d_rcv_in_ptr -usect "tov vars” , 1. ; save/restore input bffr ptr 
d_xmt_out_ptr .usect "rov_vars", 1 ; save/restore output bffr ptr 
d_frame_flag -usect "rev_vars”, 1 

d_index_count .usect "rcov_vars”,1 

; System Coefficients 

scoff -sect "coeffh” 


-include “impulse.h” 
; RAM location for the System coefficient 


hceoff -usect "pufferh”, H_FILT_SIZE 

weoff -usect "bufferw”, ADPT_FILT_SIZE ; 

; RAM location for the input data 

xh -usect "bufferx”, H_FILT_SIZE ; input data for system 

XW -usect "pufferp”, ADPT_FILT_SIZE ; input data for adaptive filter 


; RAM location for filter outputs, residual error 
7 and temporary location for the new input sample. 


ADAPT_DP -usect “adpt_var”,0 
d_primary -usect “adpt_var”,1 
d_output -usect “adpt_var”,1 
d_error -usect “adpt_var”,1 
d_mu -usect “adpt_var”,1 
d_mu_e -usect “adpt_var”,1 
d_new_x -usect “adpt_var”,1 
d_adapt_count .usect “adpt_var”,1 
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Example 10-2. Memory Allocation for Entire Application (Continued) 


COFF_FIR_START .sect 


.word 
.word 
.word 
.word 
.word 
.word 
.word 
.word 
.word 
.word 
.word 
.word 
.word 
.word 
.word 
.word 


6Fh 
OF3h 
269h 
50Dh 
8A9h 
OC99h 
OFF8h 
11EBh 
11EBh 
OFF8h 
OC99h 
8A9h 
50Dh 
269h 
OF3h 
6Fh 


COFF_FIR_END 


Y 


fir_coff_table .usect 
d_data_buffer -usect 
variables used in FI 


, 


circular buffers for coefficients and data buffers 


FIR_DP -usect 
afi din -usect 
d_filout -usect 
; variables used in II 
IIR_DP -usect 
d_iir_d usect 
d_iir_y -usect 
sect 


iir_table_start 


* 


* second-order section 


* 


iir_table_ 


.word 
.word 
.word 
.word 
.word 


=26778 
29529 
19381 
-23184 
-19381 


second-order section 


.word 
.word 
.word 
.word 
.word 


-30497 
321341. 
11363 
-20735 
11363 
end 


iir_coff_table .usect 
symmetric FIR filter 


td 


PRELIMINARY 


“Cott fir” 


WELLE COF 
"fir: bfr 


R routine 


R 


# 


# 


"fir var 
"fir var 
"fir var 
routine 
"iir_var 
"iir_var 
"iir_var 
"iir_cof 


Ol 


02 


"GOrt 11" 


coeffs 


£”, 20 
ae 40 


si. 0 
are 
Sp 


re 0 
s”,3*2 
Ss, i 
£" 


;A2 
7A1/2 
7B2 
7 BL 
;BO 


7,16 


, 


the 16 tap FIR coefficients 


, 


filter coefficients 
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Example 10-2. Memory Allocation for Entire Application (Continued) 


FIR_COFF -sect "sym_fir” ; filter coefficients 
-word 6Fh 
-word OF3h 
-word 269h 
-word 50Dh 
-word 8A9h 
-word OC99h 


-word OFF8h 

-word 11EBh 
; Circular buffers used in symmetric filter routine 
d_datax_buffer -usect "cir_bfr”,20 
d_datay_buffer -usect "cir_bfr1”,20 

-include "ref_tsk.inc” 


task_list -sect "task_tbl” ; calls the tasks itself 
-word do_nothing 
-word echo_task ; Echo routine 
-word fir_task ; FIR routine 


-word iir_task 
-word sym_fir_task 
-word adapt_task 
-word rfft_task 


task_init_list -sect "task_int” ; has the initialization of tasks 
-word do_nothing 
-word no_echo_init_task ; there is no init in this case 


-word fir_init 
-word iir_init 
-word sym_fir_init 
-word adapt_init 
-word do_nothing 
; variables used in task handling routine 


TASK_VAR_DP -usect "“tsk_vars”,0 
present_command .usect "tsk_vars”,1 
last_command -usect "tsk_vars”,1 
d_task_addr -usect "tsk_vars”,1 
d_task_init_addr .usect "tsk_vars”,1 
d_buffer_count -usect "tsk_vars”,1 
d_output_addr -usect "tsk_vars”,1 
d_input_addr -usect “"tsk_vars”,1 
; Set start addresses of buffers 
fft_data -usect "fft_bffr”, 4*K_FFT_SIZE ; fft data processing buffer 
7 Copy twiddle tables 
-sect “sin_tbl” 
sine_table . COpy twiddlel ; sine table 
sine -usect "twid_sin”,K_FFT_SIZE 
-sect "cos_tbl” 
cos_table .copy twiddle2 7 cosine table 
cosine -usect "twid_cos”,K_FFT_SIZE 
; Define variables for indexing input data and twiddle tables 
FFT_DP -usect "fft_vars”,0 
d_grps_cnt -usect "fft_vars”,1 ; (# groups in current stage)-1 
d_twid_idx -usect "fft_vars”,1 
; index of twiddle tables 
d_data_idx -usect "fft_vars”,1 , index of input data table 
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Example 10-2. Memory Allocation for Entire Application (Continued) 


a 


variables used for host interface 


Application Code 


FIFO_DP -usect "“fifo_var”,0 
d_command_reg -usect "fifo_var”,1 
d_command_value .usect “"fifo_var”,1 
d_fifo_count -usect "“fifo_var”,1 
a_fifo_ptr -usect "fifo_var”,1 
.end 
; Filename : defines.inc 
; this include file defines all the variables, buffers and pointers used for the entire 
; application 
.def STACK, SYSTEM_STACK 
.def input_data, output_data 
.def scoff,hcoff,wcoff,xh,xw 
-def ADAPT_DP,d_primary, d_output,d_error 
.def d_mu, d_mue, d_new_x, d_adapt_count 
.def fir_coff_table, d_data_buffer 
.def FIR_DP,d_filin,d_filout 
.def COFF_FIR_START, COFF_FIR_END 
.def TIR_DP,d_iir_d,iir_y 
.def iir_coff_table 
.def COFF_FIR_START, COFF_FIR_END 
-def. d_datax_buffer,d_datay_buffer 
.def FIR_COFF 
.def TASK_VAR_DP, present_command, last_command 
.def d_task_addr,d_task_init_addr, d_buffer_count, d_output_addr 
.def RCV_INTL_DP 
.def d_rcv_in_ptr,d_xmt_out_ptr 
-def d_frame_flag, d_index_count 
.def fft_data,sine, cosine 
.def FFT_DP,d_grps_cnt, d_twid_idx, d_data_idx 
.def cos_table,sine_table 
.def FIFO_DP 
.def d_command_reg 
.def d_fifo_count 
.def d_fifo_ptr 
; Filename: ref_tsk.inc 
; this includes all the task scheduling table referenced labels 
.ref do_nothing,echo_task, fir_task,iir_task 
.ref sym_fir_task, fir_init,iir_init,sym_fir_init 
.ref no_echo_init_task, fir_init,iir_init, sym_fir_init 
.ref adapt_init 
.def task_init_list,task_list 
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Example 10-3. Main Program That Calls Different Functions 


; TEXAS INSTRUMENTS INCORPORATED 
; DSP Data Communication System Development / ASP 


; Archives: PVCS 

; Filename: main.asm 

; Version: LO 

; Status : draft ( j 

; proposal (X) 

; accepted ( ) dd-mm-yy/?acceptor. 
, 

; AUTHOR Padma P. Mallela 

v 

; Application Specific Products 

; Data Communication System Development 
; 12203 SW Freeway, MS 701 

. Stafford, TX 77477 


i{ 

; IPR statements description (can be collected). 

ir} 

, 

7 (C) Copyright 1996. Texas Instruments. All rights reserved. 


i{ 
; Change history: 


PRELIMINARY 


; VERSION DATE / AUTHORS COMMENT 
; Ib July-24-96 / P.Mallela original created 
, 

i} 

i{ 

i 1. ABSTRACT 

, 

; 1.1 Function Type 

; a.Core Routine 

: b.Subroutine 

, 

: 1.2 Functional Description 

; This file contains main function 

, 

, 

i 1.3 Specification/Design Reference (optional) 
, 

7 1.4 Module Test Document Reference 

; Not done 

, 

7 1.5 Compilation Information 

H Compiler: TMS320C54X ASSEMBLER 

7 Version: 1.02 (PC) 

; Activation: asm500 -s main.asm 

, 

; 1.6 Notes and Special Considerations 


; This code is written for 541 device. The code is tested on C54x 
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Example 10-3. Main Program That Calls Different Functions (Continued) 


~ 
ed 


7 2. VOCABULARY 
, 
A 2.1 Definition of Special Words, Keywords (optional) 
; - 
; 2.2 Local Compiler Flags 
; = 
; 2.3 Local Constants 
; ~ 
i} 
i{ 
H 3. EXTERNAL RESOURCES 
r 
A 3.1 Include Files 
-mmregs 
.- include "nit _54x,.41ne" 
- include "main.inc” 
; 3.2 External Data 
.ref d_frame_flag 
.ref RCV_INT1_DP 
: 3.3 Import Functions 
.ref aic_init,serial_init,init_54,init_bffr_ptr_var 
.ref task_handler, evm_handshake, fifo_host_transfer 
i} 
i{ 
; 4. INTERNAL RESOURCES 
; 
; 4.1 Local Static Data 
; _ 
; 4.2 Global Static Data 
; _ 
; 4.3 Dynamic Data 
; = 
; 4.4 Temporary Data 
; 7 
; 4.5 Export Functions 
.def main_start 
i} 
; 5. SUBROUTINE CODE 
; HeaderBegin 
; 
; 
j------ ae a ees ee ee ei et ee te 
7 5.1 main_start 
r 
; 5.2 Functional Description 
7 This is the main function that calls other functions. 
pee a ee ee er 
r 
i 5.3 Activation 
A Activation example: 
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Example 10-3. Main Program That Calls Different Functions (Continued) 


H BD main_start 
; PSHM STO 
; PSHM ST1l 
r 
; Reentrancy: No 
; Recursive : No 
5.4 Inputs 


5.5 Outputs 


5.6 Global 


5.7 Special considerations for data structure 


5.8 Entry and Exit conditions 


|DP |ovM| SxM|c16|FRCT|ASM|ARO|AR1|AR2|AR3|AR4|AR5|AR6|AR7|A |B |BK 


a ee | 
[Nu [NU | 


ee i i en i Pee Td 


|B |BK|BRC| T|TRN| 

f | | | of ff 
nu |NU|NU|NU|NU |NU|NU | 
Wo oe | 
|NU|NU|NU|NU |NU|NU | 


| | | | | 
in |u | 1 | 1 [Nu | 1 [nu [NU [NU [NU |NU [NU |NU [NU | 

, 

| | | | boo Ph i I 
;out|u | 1 | 1 [nu | 1) [nu [NU [NU [NU |NU [NU |NU |NU [NU |NU|NU|NU|NU |NU|NU 
, 
; Note : UM - Used & Modified, U - Used, NU - Not Used 
¥ 
; 5.9 Execution 
: Execution time: ?cycles 
; Call rate: not applicable for this application 
, 
, 


; HeaderEnd 
; 5.10 Code 
-sect "“main_prg” 
KKK KKK KKK KKK KKK KKK KKK KK KKK KKK KK KKK KK KKK KKK KK KKK KK KKK KKK KK KKK KKK KKKKKKK KKK KKK 
* The code initializes the 541 device, handshake between Target (DSP) 
* and the host (PC). Zeros all buffers, variables and init. pointers 
* Initializes serial port, programs ACOQl registers for selecting sampling 
* 


rate, gains etc.. 
KEK KKK KK KKK KKK KKK KKK KKK KKK KKK KKK KK KKK KK KKK KKK KK KKK KKK KK KKK KK KKK KKKKKKKKKKK KKK 


main_start: 
CALL init_54 yj; initialize STO,ST1 PMST and 
; Other registers 
ele K_HOST_FLAG = 1 
CALL evm_handshake ; EVM host handshake 
endif 
CALL init_bffr_ptr_var ; init tables,vars,bffrs,ptr 
CALL serial_init ; initialize serial_port 1 
CALLD aic init 7 Configures ACO1 
LD #0,DP 
NOP 
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Example 10-3. Main Program That Calls Different Functions (Continued) 


KKK KKK KKK KKK KK KKK KKK KKK KKK KKK KK KKK KKK KKK KKK KKK KKK KKK KKK KK KKK KKK KKK KKK KKK KK KK 


* 
* 
* 


After enabling interrupts 


from th 


above, the rea 


1 processing starts here. 


After collecting 256 samples from ACO1 a flag(d_frame_flag is set). 
Handles the task initiated by the user and transfers the data to the 


; restore the DP loop: 

; if 256 samples are received 
; if not just loop back 

, handles task scheduling 

; EVM HOST interface 


constant 

index count 

FIR buffer size 

index count 

there are 2 bi-quad sections 
each bi-quad has 5 coeffs 
stack size 

PING/PONG buffer size 

set after 256 collected 
H(z) filter size 

W(z) filter size 

initial step constant 
Enable EVM_HOST interface 
default ACO1 init 


x Real FFT code 
points in the 
FFT size is 


toe Ts 


# of complex points (=N) 
# of stages (=logN/1log2) 
Twiddle table size 

Data index for Stage 1 

Data index for Stage 2 

Data index for Stage 3 
Butterfly counter for Stage 3 


* host. Keeps the sequence forever !!!!! 
KKK KK KKK KK KKK KK KKK KKK KEK KKK KKK KK KKK KEK KKK KKK KEK KKK KKK KEK KKK KK KKK KKKKKKKKKKKKE KK 
start_loop 

LD #RCV_INT1_DP,DP 

BITF d_frame_flag,1 

BC loop, NTC 

CALL task_handler 

CALL fifo_host_transfer 

B loop 

.end 
* Includes all the constants — main.inc 
K_O .set 0 H 
K_FIR_INDEX -set 1 7 
K FIR_BFFR .set 16 ; 
K_negl .set -lh ; 
K_BIQUAD .set 2 ; 
K TIR_SIZE .set 10 H 
K_STACK_SIZE .set 200 7 
K FRAME SIZE -set 256 ; 
K_ FRAME FLAG -set 1 . 
H FILT SIZE -set 128 , 
ADPT_FILT_SIZE -set 128 r 
K_mu -set Oh ; 
K_HOST_FLAG .set 1 ; 
K DEFAULT_ACO1 -set 1h ; 
* This include file sets the FFT size for the ’C54 
* Note that the Real FFT size (i.e. the number of 
* original real input sequence) is 2N; whereas the 
* the number of complex points formed by packing the real inputs, 
* which is N. For example, for a 256-pt Real FFT, K_FFT_SIZE 
* should be set to 128 and K_LOGN should be set 
K_FFT_SIZE .set 128 ; 
K_LOGN .set 7 ; 
K_ZERO_BK .set 0 
K_ TWID_TBL_SIZE -set 128 H 
K_DATA_IDX_1 .set 2 7 
K_DATA_IDX_2 .set 4 7 
K_ DATA_IDX_3 -set 8 7 
K_FLY_COUNT_3 .set 4 H 
K_TWID_IDX_3 -set 32 7 
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Twiddle index for Stage 3 
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Example 10-3. Main Program That Calls Different Functions (Continued) 


KKK KKK KKK KKK KKK KKK KKK KKK KK KK KKK KKK KKK KK KKK KKK KK KK KKK KKK KKK KKK KKK KKK KKK KKK KKK KKK KKK 


* FILENAME: INIT54x.INC 

* This include file contains all the initial values of STO, ST1, PMST, SWWSR, BSCR 
registers 

* STO Register Organization 


+ + + F F 
| 
| 
| 
| 
| 
| 
| 
| 
| 
| 
| 
| 
| 
| 
| 
| 
| 
| 
| 
| 
| 
| 
| 
| 
| 
| 
| 
| 
| 
| 
| 
| 
| 
| 
| 
| 
| 
| 
| 
| 
| 
| 
| 
| 


* 
KKKKKK KKK KKK KKK KKK KKK KK KKK KK KKK KKK KK KKK KKK KKK KKK KKK KKK KKK KK KKK KKK KK KKK KKK KK KK KK KKK 


K_ARP -set 000b<<13 ; ARP can be addressed from 00b -111b 
; reset value 

K._ TC -set 1b<<12 ;, TC = 1 at reset 

K_Cc -set lb<<11 ;, C = 1 at reset 

K_OVA .set 1b<<10 ; OVA = 0 at reset, Set OVA 

K_OVB .set lb<< 9 i OVB = 0 at reset, Set OVB 

K_DP .set 000000000b<<0 DP is cleared to 0 at reset 

K_STO .set K_ARP | K_TC|K_C|K Syahitovele DP 


KKK K KK KKK KKK KKK KKK KKK KK KKK KKK KK KKK KK KKK KKK KKK KK KKK KKK KKK KKK KKK KKK KKK 


*ST1 Register Organization 
* 


+ + F F HF 


KKKK KK KKK KK KKK KKK KKK KK KKK KKK KKK KKK KKK KK KKK KKK KKK KKK KKK KKK KKK KK KKK KKK KK KKK KKK KKK KKK 


K_BRAF .set Ob << 15 ; BRAF = 0 at reset 

K_CPL .set Ob << 14 ; CPL = 0 at reset 

K_XF .set lb << 13 ; XF = 1 at reset 

K_HM .set Ob << 12 ; HM = 0 at reset 

K_INTM .set lb << 11 ; INTM 

K_ST1_RESR -set Ob << 10 ; reserved 

K_OVM .set lb << 9 ; OVM = 0 at reset 

K_SXM .set lb << 8 ; SXM = 1 at reset 

K_C16 .set Ob << O7 ;, C16 = 0 at reset 

K_FRCT .set lb << 06 ; FRCT = 0 at reset, Set FRCT 
K_CMPT .set Ob << 05 i CMPT = 0 at reset 

K_ASM .set 00000b << 00 ASM = 0 at reset 

K_ST1_HIGH  .set K_BRAF | K_CPL|K_XF|K mile INTM|K_ST1_RESR|K_OVM|K_SXM 
K_ST1_LOW .set K_C16|K_FRCT|K_CMPT|K_ASM 

K_ST1 .set K_ST1_HIGH|K_ST1_Low 
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Example 10-3. Main Program That Calls Different Functions (Continued) 


KKK KKK KKK KKK KK KKK KKK KKK KKK KKK KK KKK KKK KKK KK KKK KKK KKK KKK KKK KKK KKK KEK KKK KKK KKK KKK KK KKK 


*PMST Register Organization 


ep a a a i Se ee ee ee 
* | 15 7/ 6 |5 |4 |3 | 2 | 1 0 | 
to [eneeon=-- |---==- |----] ---- |----|------ |----------- 
* | IPTR |MP/MC |OVLY|AVIS|DROM|CLKOFF| Reserved | 
KS see aa ee Se ae Se a ee aS ee SS Se 
K_IPTR set 111111111b << 07 ; 111111111b at reset 
K_MP_MC -set lb << 06 ; 1 at reset 
K_OVLY -set Ob << 05 ; OVLY = 0 at reset 
K_AVIS -set Ob << 04 ; AVIS = 0 at reset 
K_DROM -set Ob << 03 ; DROM = 0 at reset 
K_CLKOFF .set Ob << 02 ; CLKOFF = 0 at reset 
K-PMST_RESR -set OOb << 0 ; reserved 

; for 548 bit 0 = SMUL 

j saturation on multiply 

bit 1 = SST = saturation on store 

K_PMST .set K_IPTR|K_MP_MC|K_OVLY|K eurale DROM | K_CLKOFF |K_PMST_RESR 


KKK KKK KKK KKK KK KKK KKK KKK KKK KKK KK KKK KKK KKK KK KKK KKK KKK KKK KK KKK KKK KKK KKK KK KKK KKK KKK KKK 


*SWWSR Register Organization 


a a i a a a a a a a as a ee 
* | 15 |14 12|11 9|8 6| 5 3 | 2 o | 
a a a a se Se a a a a a SS SS Se 
* | Reserved | 1/0 | Data | Data |Program| Program | 
x. SS t eae]. SS 4 eS a Se Se a Se Se a a a Se a a Se Se SS Se Se Ss SS 


KKK KKK KKK KKK KK KKK KKK KKK KKK KK KKK KKK KKK KK KKK KKK KKK KKK KKK KKK KKK KKK KK KKK KKK KKK KKK KK KKK 


K_SWWSR_IO .set 2000h ; set the I/O space 
KKK KKK KKK KKK KKK KKK KKK KKK KKK KKK KKK KKK KKK KK KKK KKK KKK KKK KKK KKK KKK KKK KKK KKK KKKKKKKKKKK 


*Bank Switching Control Register (BSCR) Organization 


+ + F F 


KKK KKK KKK KKK KK KKK KKK KKK KKK KK KKK KKK KKK KKK KK KKK KKK KKK KKK KKK KK KKK KKK KKK KKK KKK KKK KK KKK 


K_BNKCMP -set O0000b << 12 ; bank size = 64K 
K_PS_DS .set Ob << 11 

K_BSCR_RESR_.set OO00000000b <<2 ; reserved space 
K_BH -set Ob. << 1 ; BH = 0 at reset 
K_EXIO -set Ob << 0 ; EXIO = 0 at reset 
K_BSCR .set K_BNKCMP | K_PS_DS|K_BSCR_RESR|K_BH|K_EXIO 
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Example 10-4. Processor Initialization 


; TEXAS INSTRUMENTS INCORPORATED 
; DSP Data Communication System Development / ASP 


; Archives: PVCS 

; Filename: init_54x.asm 

; Version: 1.0 

3; Status +: draft () 

: proposal (X) 

; accepted ( ) dd-mm-yy/?acceptor. 
sf 

; AUTHOR Padma P. Mallela 

, 

; Application Specific Products 

; Data Communication System Development 
H 12203 SW Freeway, MS 701 

: Stafford, TX 77477 


i { 

, 

; IPR statements description (can be collected). 

i} 

, 

7 (C) Copyright 1996. Texas Instruments. All rights reserved 


i{ 
; Change history: 


PRELIMINARY 


; VERSION DATE / AUTHORS COMMENT 
- 1.0 July-29-96 / P.Mallela original created 
, 

i} 

i{ 

7 1. ABSTRACT 

, 

; 1.1 Function Type 

7 a.Core Routine 

; b. Subroutine 

, 

7 1.2 Functional Description 

: This file contains initialization of the processor 

r 

, 

7 1.3 Specification/Design Reference (optional) 

, 

; 1.4 Module Test Document Reference 

; Not done 

, 

7 1.5 Compilation Information 

7 Compiler: TMS320C54X ASSEMBLER 

; Version: 1.02 (PC) 

7 Activation: asm500 -s init_54x.asm 

, 

; 1.6 Notes and Special Considerations 

r{ 

, 

; This code is written for 541 device. The code is tested on C54x EVM 
, 
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Example 10-4. Processor Initialization (Continued) 


i} 
an 


, 


: 2. VOCABULARY 
, 
: 2.1 Definition of Special Words, Keywords (optional) 
; - 
; 2.2 Local Compiler Flags 
; = 
; 2.3 Local Constants 
; = 
i} 
i{ 
Hi 3. EXTERNAL RESOURCES 
r 
7 3.1 Include Files 

-mmregs 

- include "init _ 54x%,ine” 
. 3.2 External Data 

3.3 Import Functions 


~ 
ed 


} 4. INTERNAL RESOURCES 
F 
; 4.1 Local Static Data 
¥ 
; 4.2 Global Static Data 
; Pa 
; 4.3 Dynamic Data 
; a 
; 4.4 Temporary Data 
’ 
: 4.5 Export Functions 
.def init_54 
7} 
; 5. SUBROUTINE CODE 
7 HeaderBegin 
, 
, 
— 22 = = ae 22 _ =z ee eee a _ 
; 5.1 init_54 
, 
7 5.2 Functional Description 
; Initializes the processor from a reset state 
; 22 abe 22tS= eee = ees — = = = 
, 
; 5.3 Activation 
; Activation example: 
; CALL init_54 
, 
; Reentrancy: No 
7 Recursive : No 
, 
. 5.4 Inputs 
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Example 10-4. Processor Initialization (Continued) 


5.5 Outputs 


5.6 Global 


5.7 Special considerations for data structure 


5.8 Entry and Exit conditions 


|DP | ovM| SxM st ec oa asl ase cae ke a ts B |BK|BRC| T 


| | | | | TRN | 

Nu |NU |NU |NU |NU |NU |NU |NU wo ia Peer ie a 
ae (eee ee) (a | | | 

|NU|NU|NU|NU_ | | 


, 

, 

, 

, 

, 

, 

, 

, 

, 

, 

, 

, 

;in hs NU ba N 
i | | 
jout | | 
, 

, 

, 

, 

i 

, 

, 

, 

, 

, 


U 
out cele nu |Nu |NU |NU |NU |NU |NU |NU |UM |UM |NU bis NuU|NU|NU|NU |NU|NU 
Note : UM - Used & Modified, U - Used, NU - Not Used 
5.9 Execution 
Execution time: ?cycles 
Call rate: not applicable for this application 
HeaderEnd 
5.10 Code 
-sect "main_prg” 
init_54: 
; Init.the s/w wait state reg.for 2 wait states for I/O operations 
STM #K_SWWSR_IO, SWWSR ; 2 wait states for I/O operations 
; wait states for Bank Switch 
STM #K_BSCR, BSCR ; O wait states for BANK SWITCH 
; initialize the status and control registers 
STM #K_STO, STO 
STM #K_ST1, ST1 
RETD 
STM #K_PMST, PMST 
end 
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Example 10-5. Handshake Between Host and Target 


; TEXAS INSTRUMENTS INCORPORATED 

; DSP Data Communication System Development / ASP 
, 

; Archives: PVCS 

; Filename: hand_shk.asm 

; Version: 1.0 

, Status draft 

; proposal (X) 

; accepted (_) dd-mm-yy/?acceptor. 
, 

; AUTHOR Padma P. Mallela 


if 
; IPR st 


7 (CE) Co 


i{ 
; Change 


; Ly 


; dD 
, 
, 
; a ees 
, 
, 
; 1.4 
r 
, 
; ag 
r 
r 
r 
r 
; 1.6 


~ 
eee 


~ 


Application Specific Products 

Data Communication System Development 
12203 SW Freeway, MS 701 

Stafford, TX 77477 


atements description (can be collected). 
pyright 1996. Texas Instruments. All rights reserved. 
history: 
VERSION DATE i AUTHORS COMMENT 

0 July-26-96 / P.Mallela original created 
ABSTRACT 
Function Type 


a.Core Routine 
b.Subroutine 


Functional Description 

This file contains one subroutine: 

1) evm_handshake 

Specification/Design Reference (optional) 

called by main.asm depending upon if K_HOST_FLAG is set 


Module Test Document Reference 

Not done 

Compilation Information 

Compiler: TMS320C54X ASSEMBLER 
Version: 1.02 (PC) 

Activation: asm500 -s hand_shk.asm 
Notes and Special Considerations 
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Example 10-5. Handshake Between Host and Target (Continued) 


H 2. VOCABULARY 
, 
7 2.1 Definition of Special Words, Keywords (optional) 
; = 
; 2.2 Local Compiler Flags 
; _ 
; 2.3 Local Constants 
; a 
i} 
i{ 
; 3. EXTERNAL RESOURCES 
r 
7 3.1 Include Files 
-mmregs 
- include "target.inc” 
-include "init _54x. ine” 
- include “interrpt.inc” 
; 3.2 External Data 
.ref FIFO_DP 
.ref d_command_reg 
.ref d_command_value 
; 3.3 Import Functions 
i} 
i{ 
H 4. INTERNAL RESOURCES 
r 
H 4.1 Local Static Data 
; = 
7 4.2 Global Static Data 
; = 
H 4.3 Dynamic Data 
; = 
; 4.4 Temporary Data 
; e 
; 4.5 Export Functions 
.def evm_handshake 
i} 
H 5. SUBROUTINE CODE 
; HeaderBegin 
, 
r 
; = 
7 5.1 evm_handshake 
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Example 10-5. Handshake Between Host and Target (Continued) 


; 5.2 Functional Description 

, 

; This initiates the handshake between the host(PC) and the target (DSP). 
; The host writes a command to CH A. This generates an INT1 on the target. 
: The AXST bit on HCR is set to 1.The bit in IFR is polled if it is set 

A then it is cleared to clear pending interrupts. The FIFO is cleared 

; by reading from the FIFO. The command from host is read thru CH A and 

; ARST on TCR is cleared. Another command from target is written to CH A, 
; which sets AXST. Also sets XF low. The host polls XF line. The host reads 
; CH A which clears ARST on host side and AXST on target side. 

, 

pS a a a a ee ee 

, 

; 5.3 Activation 

; Activation example: 

; CALL evm_handshake 

, 

; Reentrancy: No 

; Recursive : No 

, 

; 5.4 Inputs 

; NONE 

? 5.5 Outputs 

; NONE 

, 

. 5.6 Global 

; Data structure: d_command_reg 

‘ Data Format: 16-bit variable 

; Modified: Yes 

; Description: command from host is read thru CH A 

, 

; Data structure: d_command_value 

; Data Format: 16-bit variable 

; Modified: Yes 

7 Description: holds the command value 

; 5.7 Special considerations for data structure 

; - 

7 5.8 Entry and Exit conditions 

; 

: Tid (CSAs lls Geis cc) Gc |B |BK|BRC| T|TRN 
| | | | oe | 

jin ju | 1 [1 [nu | 1 [NU [NU [NU [NU [NU leg [nu |Nu |NU |NU|NU|NU|NU |NU|NU 
Poo dk ee he DT ee ot 
;out|u | 1 | 1 [nu | 1 [Nu [NU [NU [NU [NU |NU [NU |NU |NU |UM|NU|NU|NU |NU|NU | 
, 

; Note : UM - Used & Modified, U - Used, NU - Not Used 

, 

; 5.9 Execution 

; Execution time: ?cycles 

; Call rate: not applicable for this application 
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Example 10-5. Handshake Between Host and Target (Continued) 


; HeaderEnd 


7 5.10 Code 
-sect "handshke” 
evm_handshake: 
mp) #0,DP 
BITF IFR, 02h 
BC evm_handshake, NTC 
STM #K_INT1,IFR 
LD #FIFO_DP,DP 
RPT #K_FIFO_SIZE-1 
PORTR K_CHB, d_command_reg 
PORTR K_CHA, d_command_value 
target_handshake_command: 
PORTR K_TRGCR_ADDR, d_command_reg 
BITF d_command_reg, K_ARST 
BC target_handshake_command, TC 
LD #K_HANDSHAKE_CMD,A 
SUB d_command_value,A 


bad_handshake_command 
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BC bad_handshake_command, ANEQ 
ST #K_AXST_CLEAR, d_command_reg 
PORTW d_command_reg, K_CHA 

RSBX XF 

RET 

.end 


, 
, 
i 
Ma 
, 
, 
, 


[i 


Poll for INT1 
ARST = 1 
clear the pending interrupt 


assures that FIFO is empty to 
ARST = 0 

read the command from HOST 

o acknowledge INT1 

while (port14 & ARST) 

check FIFO empty 

branch occurs 

indicate of FIFO empty 


read the command send by hosts 
send to a command to clear AXST 
write command to command reg A 
AXST = 1 
XF = 0 
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Example 10-6. Initialization of Variables, Pointers, and Buffers 


; TEXAS INSTRUMENTS INCORPORATED 
; DSP Data Communication System Development / ASP 


; Archives: 
; Filename: 


7; Vers 
; Stat 


ion: 
us 


; AUTHOR 


if 


PVCS 

pres_int.asm 

1.0 

draft 

proposal (X) 

accepted (_) dd-mm-yy/?acceptor. 


Padma P. Mallela 


Application Specific Products 

Data Communication System Development 
12203 SW Freeway, MS 701 

Stafford, TX 77477 


; IPR statements description (can be collected). 


i} 
7 AC) 


if 


Application Code 


Copyright 1996. Texas Instruments. All rights reserved. 


; Change history: 


} This 
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V. 


ERSION DATE / AUTHORS COMM 


ENT 


July-29-96 / P.Mallela original created 


ABSTRACT 


.1 Function Type 
a.Core Routine 
b. Subroutine 


-2 Functional Description 


This file contains initialization of buffers,pointers and variables 


.3 Specification/Design Reference (optional) 
-4 Module Test Document Reference 
Not done 
-5 Compilation Information 
Compiler: TMS320C54X ASSEMBLER 
Version: 1.02 (PC) 
Activation: asm500 -s prcs_int.asm 


-6 Notes and Special Considerations 


code is written for 541 device. The code is tested on C54x EVM 
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Example 10-6. Initialization of Variables, Pointers, and Buffers (Continued) 


i} 
i{ 


, 


; 2. VOCABULARY 
, 
; 2.1 Definition of Special Words, Keywords (optional) 
; _ 
; 2.2 Local Compiler Flags 
; = 
; 2.3 Local Constants 
; = 
i} 
i{ 
; 3. EXTERNAL RESOURCES 
; 
; 3.1 Include Files 
-mmregs 
- include "main.inc” 
; 3.2 External Data 
.ref input_data, output_data 
.ref present_command 
.reft d_rcv_in_ptr,d_xmt_out_ptr 
.ref RCV_INT1_DP 
.ref d_buffer_count 
; 3.3 Import Functions 
i} 
i{ 
7 4. INTERNAL RESOURCES 
, 
; 4.1 Local Static Data 
; = 
; 4.2 Global Static Data 
; ee 
7 4.3 Dynamic Data 
; = 
F 4.4 Temporary Data 
; = 
; 4.5 Export Functions 
.deft init_bffr_ptr_var 
i} 
, 
H 5. SUBROUTINE CODE 
; HeaderBegin 
, 
; 
dice ie eS oe a ee ee ee re ee ee = 
; Sel anit btitr ptr 
, 
7 5.2 Functional Description 
; This routine initializes all the buffers, pointers and variables 
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Example 10-6. Initialization of Variables, Pointers, and Buffers (Continued) 


; 5.3 Activation 

: Activation example: 

; CALL init_bffr_ptr_var 

; Reentrancy: 

7 Recursive 

, 

: 5.4 Inputs 

; NONE 

; 5.5 Outputs 

; NONE 

: 5.6 Global 

, 

; Data structure: AR2 

7 Data Format: 16-bit input buffer pointer 

7 Modified: Yes 

; Description: initialize to the starting address 
ta 

; Data structure: AR3 

A Data Format: 16-bit output buffer pointer 

7 Modified: Yes 

7 Description: initialize to the starting address 
i 

; Data structure: present_command 

; Data Format: 16-bit variable 

7 Modified: Yes 

7 Description: holds the present command 

ti 

: Data structure: input_data 

A Data Format: 16-bit array 

7 Modified: Yes 

; Description: address of the input data buffer 
v 

; Data structure: output_data 

A Data Format: 16-bit array 

; Modified: Yes 

y Description: address of the output data buffer 
a 

; Data structure: d_rcv_in_ptr 

; Data Format: 16-bit var 

; Modified: Yes 

; Description: holds the staritng address of input bffr 
2 

; Data structure: d_xmt_out_ptr 

; Data Format: 16-bit variable 

; Modified: Yes 

: Description: holds the starting address of output bffr 
v 

; 5.7 Special considerations for data structure 
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Example 10-6. Initialization of Variables, Pointers, and Buffers (Continued) 


; 5.8 Entry and Exit conditions 
rd 
; Pe oem ony ones ease stair cha i coe ks (a |? ae sha 
, 
jin e fa | 1 hi | 1 ik i [NU [NU be | NU re ba e fa ak ie ae | 
, 
yout]/2 | 1 | 1 |Nu | 1 [Nu [NU ea fei | UM a |Nu |NU |NU |UM|NU]O |NU |NU|NU | 
, 
7 Note UM - Used & Modified, U - Used, NU - Not Used 
, 
, 
; 5.9 Execution 
: Execution time: ?cycles 
; Call rate: not applicable for this application 
roe 
; HeaderEnd 
; 5.10 Code 
-asg AR1, ZRPAD_P 7 zero pad pointer 
-asg AR2,GETFRM_IN_P ; get frame input data pointer 
-asg AR3,GETFRM_OUT_P , get frame output data pointer 
-asg AR2,COFF_P 
-sect "zeropad” 
init_bffr_ptr_var: 
STM #RCV_INT1_DP,AR2 ¢ Init all vars to 0 
RPTZ A,K_FRAME_SIZE/2-1 ; this may need mods if all vars 
STL A, *AR2+ ; are not in 1 page 
STM #input_data, GETFRM_IN_P ; input buffer ptr 
STM #output_data, GETFRM_OUT_P ; output buffer ptr 
LD #RCV_INT1_DP, DP 
MVKD GETFRM_IN_P,d_rcv_in_ptr holds present in. bffr ptr 
MVKD GETFRM_OUT_P,d_xmt_out_ptr holds present out bffr ptr 
ST #3,present_command initialize present command 
ST #K_0, d_buffercount reset the buffer count 
STM #input_data, ZRPAD_P 
RPTZ A, 2*K_FRAME_SIZE-1 zeropad both bottom 256 in- 
put_data 
STL A, *ZRPAD_P+ and fft_data buffers 
STM #output_data, ZRPAD_P 
RPTZ A, 2*K_FRAME_SIZE-1 
STL A, *ZRPAD_P+ 
RET 
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7. VE 


EXAS INSTRUMENTS INCORPORATED 


; DSP Data Communication System Development / ASP 


; Archives: PVCS 

; Filename: init_ser.asm 

; Version: 1,0 

3}; Status + draft () 

: proposal (X) 

; accepted ( ) dd-mm-yy/?acceptor. 
’ 

; AUTHOR Padma P. Mallela/Ramesh A Iyer 


if 


Application Specific Products 

Data Communication System Development 
12203 SW Freeway, MS 701 

Stafford, TX 77477 


; IPR statements description (can be collected). 


i} 


; (C) Copyright 1996. Texas Instruments. All rights reserved. 


if 


; Change history: 


VERSION DATE / AUTHORS COMMENT 
1.0 July-29-96 / P.Mallela original created 


1. ABSTRACT 
1.1 Function Type 
a.Core Routine 


b.Subroutine 


1.2 Functional Description 
This file contains the initialization of the serial port 1 


1.3 Specification/Design Reference (optional) 


1.4 Module Test Document Reference 
Not done 


1.5 Compilation Information 


Compiler: TMS320C54X ASSEMBLER 
Version: 1.02 (PC) 
Activation: asm500 -s init_ser.asm 


1.6 Notes and Special Considerations 


This code is written for 541 device. The code is tested on C54x EVM ; 
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PRELIMINARY 


i 2. VOCABULARY 
, 
; 2.1 Definition of Special Words, Keywords (optional) 
, 
: 2.2 Local Compiler Flags 
, 
; 2.3 Local Constants 
; = 
i} 
i{ 
i 3. EXTERNAL RESOURCES 
, 
; 3.1 Include Files 
-mmregs 
- include “interrpt.inc” 
- include “init_ser.inc” 
; 3.2 External Data 
; NONE 
; 3.3 Import Functions 
; NONE 
i} 
i{ 
; 4. INTERNAL RESOURCES 
, 
; 4.1 Local Static Data 
AIC_VAR_DP -usect “aic_vars”,0 
aic_in_rst -usect "aic_vars”,1 
aic_out_of_rst .usect "aic_vars”,1 


-2 Global Static Data 


-3 Dynamic Data 


-4 Temporary Data 
-5 Export Functions 
.def serial_init 


SUBROUTINE CODE 
HeaderBegin 


-l serial_init 


-2 Functional Description 
This routine initializes the serial port 1 of 541. The serial port is put 
in reset by writting 0’s to RRST and XRST bits and pulled out of reset by 
writting 1’s to both RRST and XRST bits. This routine also puts the ACO1 
in reset and after 12 cycles the ACO1 is pulled out of reset. The serial 
port initialization is done during the 12 cylce latency of the ACO1 init. 
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Example 10-7. Initialization of Serial Port 1 (Continued) 


5.3 Activation 
Activation example: 


Ne Ne Ne Ne Ne Ne 


CALL serial_init 

Reentrancy: No 

Recursive : No 
5.4 Inputs 


5.5 Outputs 
5.6 Global 


Data structure: aic_in_rst 


Data Format: 16-bit variable 
Modified: Yes 
Description: holds the value to put ACO1 in reset state 


Data structure: aic_out_of_reset 


ee i i Tn Ti Tee od 


Data Format: 16-bit variable 
; Modified: Yes 
; Description: holds the value to put ACO1 out of reset state 
; 5.7 Special considerations for data structure 
; - 
; 5.8 Entry and Exit conditions 
f 
; |pp |ovm|sxm|c16|FRCT|ASM|ARO|AR1|AR2|AR3|AR4|AR5|AR6|AR7|A |B |BK|BRC| T|TRN| 
r 
| | | | | | | | | 
;in |o | 1 | 1 [nu | 1 [nu [NU [NU [NU |NU [NU |NU [NU [NU |NU|NU|NU|NU |NU|NU | 
, 
ay ee Cee ee | | Lo) Ved 
;out|u | 1 | 1 [wu | 1 [Nu [NU |NU [NU |NU [NU |NU [NU |NU |NU|NU|NU|NU |NU|NU | 
€ 
; Note : UM - Used & Modified, U - Used, NU - Not Used 
F 3 
; 5.9 Execution 
; Execution time: ?cycles 
; Call rate: not applicable for this application 
. 2 
v 
; HeaderEnd 
: 5.10 Code 
-sect "ser_cnfg” 
serial_init: 
LD #AIC_VAR_DP, DP ; initialize DP for aic_reset 
oT #K_0O, aic_in_rst ; bit 15 = 0 of TCR resets AIC 
PORTW aic_in_rst,K_TRGCR_ADDR j;write to address 14h (TCR) 


KKK KKK KKK KKK KK KKK KKK KKK KKK KKK KK KKK KKK KKK KK KKK KKK KKK KKK KKK KKK KKK KKK KKK KKK KKK KKK KKK 


*We need at least 12 cycles to pull the AIC out of reset. 


KKK KKK KKK KKK KK KKK KKK KKK KKK KK KKK KKK KKK KKK KK KKK KKK KKK KKK KKK KKK KKK KKK KKK KKK KKK KKK KKK 
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Example 10-7. Initialization of Serial Port 1 (Continued) 


STM #K_SERIAL RST, SPC1 ;reset the serial port with 
70000 0000 0000 1000 

STM #K_SERIAL_OUT_RST, SPC1 ;bring ser.port out of reset 
70000 0000 1100 1000 

RSBX INTM 

LD #0,DP 

ORM # (K_RINT1|K_INT1),IMR ; Enable RINT1,INT1 
7 0000 0000 0100 0010 

LD #AIC_VAR_DP, DP j; restore DP 

STM # (K_RINT1),IFR ; clear RINT1 

STM #K_0,DXR1 0000 0000 0100 0000 


; Pull the ACO1 out of reset - the ACO1l requires that it be held in reset for 


; 1 MCLK, which is equivalent to 96.45ns (based on an MCLK of 10.368MHz) 

chk #K_8000, aic_out_of_rst ; bit 15 = 1 brings AIC from 
reset 

RETD 

PORTW aic_out_of_rst, K_TRGCR_ADDR ; AIC out of reset 

.end 


KKK KKK KKK KKK KKK KKK KKK KKK KKK KKK KK KKK KK KKK KKK KKK KKK KKK KKK KKK KKK KKK KKK KKK KKK 


* FILENAME: “INIT_SER.INC™ 


Oh a a a a a ss a is Se Se ee a ee eee 
* 15 14 | 13 | 12 | 11 | 10 | 9 | 8 | 
#  [-----|-=--- |-------- |----=---- |------ |--=--- |----|-----| 
* FREE | SOFT| RSRFULL| XSREMPTY| XRDY | RRDY | IN1| INO | 
i SSeS SSS] aS SS a a SS SS a Se ee a ee Se a ee SS 
Sp as a a a a a as a 

* 7 6 | 5/4 | 3 | 2 ]1 | o | 

#  [aaa--|----- |--=-- |---| --=--]--=-|-----|-----| 

* RRST | XRST| TxXM | McM| FSM | FO | DLB | RES | 

* 

*x 


This include file includes the SPCl register configuration 
KKK KK KKK KK KKK KKK KK KKK KK KKK KKK KK KKK KK KKK KKK KK KKK KK KKK KK KKK KKKKKKKK KKK KKKKK 


;BLE Name Function 

20 Reserved Always read as 0 

71 DLB Digital loop back: 0 -> Disabled, 1_.Enabled 

pe FO Format bit: 0 -> data transfered as 8 bit bytes, 1 -> 16 bit 
words 

73 FSM Frame sync pulse: 0 -> serial port in continuous mode, 1 -> FSM 
is required 

74 MCM Clock mode bit: 0 -> CLKX obtained from CLKX pin 1 -> CLKX 
obtained from CLKX 

Pes) TXM Transmit mode bit: 0 -> Frame sync pulses generated externally 


and supplied on FSX pin, 1 -> Internally generated frame sync 
pulses out on FSX pin 


76 XRST Transmit reset bit: 0 -> reset the serial port, 1 -> bring 
serial port out of reset 

ey RRST Receive reset bit: 0 -> reset the serial port, 1 -> bring 
serial port out of reset 

78 INO Read-only bit reflecting the state of the CLKR pin 

79 IN1 Read-only bit reflecting the state of the CLKX pin 

710 RRDY Transition from 0 to 1 indicates data is ready to be read 

711 XRDY Transition from 0 to 1 indicates data is ready to be sent 
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72 XSREMPTY Transmit shift register empty ( Read-only) 0 -> tramsitter 
has experienced underflow 

713 RSRFUL Receive shift register full flag (Read-only): 0 -> Receiver 
has experienced overrun 

714 SOFT Soft bit - 0 -> immdeiate stop, 1-> stop after word completion 

e15 FREE Free run bit: 0 -> behaviour depends on SOFT bit, 1-> free run 


regardless of SOFT bit 
; The system has the following configuration: 


: Uses 16-bit data => FO = 0 

; Operates in burst mode => FSM = 1 

7 CLKX is derived from CLKX pin => MCM = 0 

; Frame sync pulses are generated externally by the AIC => TXM = 0 


; Therefore, to reset the serial port, the SPC field would have 
; 0000 0000 0000 1000 

; To pull the serial port out of reset, the SPC field would have 
; 0000 0000 1100 1000 


K_O .set OO0000000b << 8 ; bits 15-8 to 0 at reset 
K_RRST .set Ob << 7 ; First write to SPCl is 0 
; second write is 1 
K_XRST -set Ob << 6 ; First write to SPCl is 0 
; second write is 1 
K_TXM -set Ob << 5 
K_MCM .set Ob << 4 
K_FSM -set lb << 8 ; Frame Sync mode 
K_ZERO .set 000b << 0 
K_SERIAL_RST .set K_0|K_RRST|K_XRST|K_TXM|K_MCM|K_FSM|K_ZERO 
, first write to SPC1l register 
K_RRST1 -set lb << 7 ; second write to SPCl 
K_XRST1 -set Ib << 6 ; second write to SPC1l 
K_SERIAL_OUT_RST .set K_0|K_RRST1|K_XRST1|K_TXM|K_MCM|K_FSM|K_ZERO 
K_TRGCR_ADDR .set 14h ; Target/Status I/O address 
K_O .set Oh 
K_8000 -set 8000h , set bit 15 to pull AIC out 
; of reset 
* FILENAME: INTERRUPT.INC 
Co = Sa SS Se Se SS a a a es Se eS a et ee 
* | 15 9| 8 | 7 | 6 | 5 [| 4 | 3 |2 J2 |] 0 | 
ee Gene |-----|------ |------ |------|------ |-----|-----| -----|-----| 
* Reserved | INT3| XINT1| RINT1| XINTO| RINTO| TINT| INT2| INT1| INTO| 
a 
* 


This file includes the IMR and IFR configuration 
KKK KK KKK KK KKK KKK KEK KKK KK KKK KKK KKK KKK KKK KKK KKK KKK KKK KKK KKK KKK KKK KKK KKK KKKKKKKKKKKAKEK 


K IMR _RESR -set O0000000b << 9 ; xveserved space 

K_INT3 .set lb << 8 ; Gisable INT3 

K_XINT1 -set lb << 7 ; Gisable transmit interrupt 1 
K RINT1 -set Lo << 6 ; enable receive interrupt 1 
K_ XINTO -set lo << 5 ; Gisable transmit interrupt 0 
K_RINTO .set lb << 4 ; Gisable receive interrupt 
K_TINT .set ib << 3 ; Gisbale timer interrupt 

K_ INT2 .set lb << 2 ; disable INT2 

K_ INT1 .set lb << 1 ; Gdisbale INT1 

K_INTO .set lb << 1 ; enable INTO 
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; TEXAS INSTRUMENTS INCORPORATED 
; DSP Data Communication System Development / ASP 


; Archives: PVCS 

; Filename: init_aic.asm 

; Version: 1.0 

3; Status +: draft () 

7 proposal (X) 

; accepted ( ) dd-mm-yy/?acceptor. 
rf 

; AUTHOR Padma P. Mallela/Ramesh A. Iyer 

, 

. Application Specific Products 

; Data Communication System Development 
; 12203 SW Freeway, MS 701 

; Stafford, TX 77477 


i{ 

; IPR statements description (can be collected). 

i} 

7(C) Copyright 1996. Texas Instruments. All rights reserved. 


r{ 
+ 
; Change history: 


PRELIMINARY 


7 VERSION DATE / AUTHORS COMMENT 
7 1.0 July-29-96/ P.Mallela original created 
, 

i} 

i{ 

; 1. ABSTRACT 

, 

; 1.1 Function Type 

; a.Core Routine 

: b. Subroutine 

, 

7 1.2 Functional Description 

; This file contains the initialization of ACO1 

, 

; 

: 1.3 Specification/Design Reference (optional) 

, 

A 1.4 Module Test Document Reference 

: Not done 

, 

; 1.5 Compilation Information 

; Compiler: TMS320C54X ASSEMBLER 

H Version: 1.02 (PC) 

; Activation: asm500 -s init_aic.asm 

, 

: 1.6 Notes and Special Considerations 

i{ 

; This code is written for 541 device. The code is tested on C54x 
i} 
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Example 10-8. ‘AC01 Initialization (Continued) 
i{ 


H 2. VOCABULARY 
; 
7 2.1 Definition of Special Words, Keywords (optional) 
; is 
; 2.2 Local Compiler Flags 
; = 
; 2.3 Local Constants 
; ~ 
i} 
i{ 
Hi 3. EXTERNAL RESOURCES 
, 
; 3.1 Include Files 
-mmregs 
- include “interrpt.inc” 
; 3.2 External Data 
; NONE 
H 3.3 Import Functions 
.ref wrt_cnfg ; initializes ACO1 
i} 
i{ 
H 4. INTERNAL RESOURCES 
, 
i 4.1 Local Static Data 
; = 
; 4.2 Global Static Data 
; = 
7 4.3 Dynamic Data 
; ~ 
; 4.4 Temporary Data 
; = 
; 4.5 Export Functions 
.def aic_init 
i} 
; 5. SUBROUTINE CODE 
; HeaderBegin 
, 
, 
, 
y------ es a a eens a eee a ees eae 
; 5.1 aie _Anit 
, 
; 5.2 Functional Description 
Z This routine disables IMR and clears any pending interrupts before 
; initializing ACO1. The wrt_cnfg function configures the ACO1 
y------ es ean eres ae ee er eae — — 
; 
; 5.3 Activation 
; Activation example: 
; CALL aic_init 
, 
; Reentrancy: No 
; Recursive : No 
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PRELIMINARY 


7 5.4 Inputs 
; 
i 5.5 Outputs 
; 
; 5.6 Global 
; 
; 
; 5.7 Special considerations for data structure 
; _ 
; 5.8 Entry and Exit conditions 
, 
; |DP | ovM| SxM|C16|FRCT|ASM|ARO|AR1|AR2|AR3|AR4|AR5|AR6|AR7|A |B |BK|BRC| T|TRN| 
, 
| | | ; | | | of | | 
sin | Jo [| 21 | 1 [nu | 1 [Nu [Nu [NU |NU [NU |NU |NU [NU |NU|NU|NU|NU |NU|NU | 
, 
| | | | ; | | | of | | 
sout| |o | 1 | 1 [nu | 1 [Nu [NU [NU |NU [NU |NU [NU |NU |NU|NU|NU|NU |NU|NU | 
, 
; Note UM - Used & Modified, U - Used, NU - Not Used 
, 
; 5.9 Execution 
; Execution time: ?cycles 
7 Call rate: not applicable for this application 
, 
, 
; HeaderEnd 
7 5.10 Code 
-sect "are entg” 
aic_init: 
CALLD wrt_cnfg yj initialize ACO1 
ANDM # (~K_RINT1|K_INT1), IMR ; Gisable receive_intl, INT1 
ORM # (K_RINT1|K_INT1),IMR ; enable the RINT1, INT1 
RETD 
STM (K_RINT1),IFR ; service any pending interrupt 
.end 


KKK KK KKK KK KKK KKK KK KKK KKK KK KKK KKK KK KKK KK KKK KKK KKK KK KKK KKK KK KKK KKK KKK KKK 


* This file includes the ACO1 registers initialization 

* All registers have 2 control bits that initiates serial communication 

* There are 2 communication modes - primary and secondary communications 

* During primary communication the control bits DOO and DO1 are 11 to request 
* for a secondary communication. In the secondary serial communications the 
* control bits D15 and D14 perform same control function as primary. 

* The R/W~ bit at reset is set to 0 placing the device in write mode. 

KEK KK KKK KKK KK KKK KK KKK KKK KK KKK KKK KK KKK KKK KK KKK KKK KK KKK KKK KKK KK KKKKKKKKEK 
K_NOP_ADDR -set 0 << 8 

K_REG_O -set K_NOP_ADDR 

K_A_ADDR .set 1 << 8 ; REG 1 address 

K_A_REG set 36 ; 

K_REG_1 -set K_A_ADDR|K_A_REG ; FCLK = 144KHz => A =24h 
K_B_ADDR .set 2 << 8 ; REG 2 address 

K_B_REG .set 18 ; 

K_REG_2 .set K_B_ADDR|K_B_REG ; Sampling rate = 8KHz 

K_AA ADDR .set 3 << 8 ; Register 3 address 

K_AA_REG .set 0 ; 
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K_REG_3 ~set K_AA _ ADDR|K_ AA_REG; 7 no shift 

K_GAIN_ADDR -set 4 << 8 ; Register 4 address 

K_MONITOR_GAIN set O00b << 4 ; Monitor output gain = squelch 

K_ANLG_IN_GAIN set Olb << 2 ; Analog input gain = OdB 

K_ANLG_OUT_GAIN .set Olb << 0 Analog output gain = OdB 

K_REG_4 -set K_GAIN ADDR | K MONITOR Gaul ANLG_IN GAIN |K ANLG_OUT_GAIN 

K_ANLG_CNF_ADDR -set 5 << 8 ; Register 5 address 

K_ANLG_RESRV .set 0 << 3 ; Must be set to OK_HGH_FILTER .set 0 << 2 
; High pass filter is enabled 

K_ENBL_IN .set Olb << 0 ; Enables IN+ and IN- 

K_REG_5 -set K_ANLG_CNF_ADDR | K_ANLG RESRV | K HGH. FILTER|K_ ENBL_IN 

K_DGTL_CNF_ADDR -set 6 << 8 ; Register 6 address 

K_ADC_DAC -set 0..<<-5 ; ADC and DAC is inactive 

K_FSD_OUT -set Q << 4 ; Enabled FSD output 

K_16_BIT_COMM -set 0 << 3 ; Normal 16-bit mode 

K_SECND_COMM -set 0 << 2 ; Normal secondary communication 

K_SOFT_RESET -set 0 << 1 ; Inactive reset 

K_POWER_DWN .set 0 << 0 Power down external 

K_REG_HIGH_6 -set K_DGTL_CNF. ADDR |K ADC anere FSD. OUT |K 16_BIT_COMM 

K_REG_LOW_6 -set K_SECND COMM | K SOFT. RESET | K POWER_DWN 

K_REG_6 -set K_REG_HIGH 6|K REG_LOW_6 

K_FRME_SYN_ADDR -set 7 << 8 ; Register 7 address 

K_FRME_SYN «Set 0 << 8 7 

K_REG_7 -set K_FRME_SYN. ADDR |K FRME_ SYN 

K_FRME_NUM_ADDR .set & << 6 i Register 8 address 

K_FRME_NUM -set 0 << 8 

K_REG_8 .set K_FRME_NUM ADDR|K FRME NUM 


; primary word with DO1 and DOO bits set to 11 will cause a 
; secondary communications interval to start when the frame 
7 sync goes low next 


K_SCND_CONTRL .set lib << 0 ; Secondary comm.bits 
AIC_REG_START_LIST .sect "“aic_reg” ; includes the aic table 
.word AIC_REG_END_LIST-AIC_REG_START_LIST-1 
-word K_REG_1 
-word K_REG_2 
-word K_REG_3 
-word K_REG_4 
-word K_REG_5 
-word K_REG_6 
-word K_REG_7 
-word K_REG_8 
AIC_REG_END_LIST 
K_XRDY .set 0800h ; XRDY bit in SPC1 
-sect "“aic_cnfg” 
aic_init: 
CALLD wrt_cnfg ; initialize ACO1 
ANDM # (~K_RINT1),IMR ; Gisable receive_intl 
ORM # (K_RINT1|K_INT1),IMR ; enable the RINT1, INTL 
RETD 
STM #(K_RINT1),IFR , service any pending interrupt 
.end 
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Example 10-9. ‘ACO1 Register Configuration 


; TEXAS INSTRUMENTS INCORPORATED 
; DSP Data Communication System Development / ASP 


; Archives: PVCS 

; Filename: aic_cfg.asm 

; Version: 1.0 

; Status : draft ( j 

; proposal (X) 

; accepted ( ) dd-mm-yy/?acceptor. 
Lf 

; AUTHOR Padma P. Mallela/Ramesh A. Iyer 

v 

; Application Specific Products 

7 Data Communication System Development 
; 12203 SW Freeway, MS 701 

; Stafford, TX 77477 


; IPR statements description (can be collected). 


7(C) Copyright 1996. Texas Instruments. All rights reserved. 


; Change history: 


a 


; VERSION DATE / AUTHORS COMMENT 
yo sO July-25-96 / P.Mallela original created 
, 

i} 

i{ 

; 1. ABSTRACT 

, 

; 1.1 Function Type 

; a.Core Routine 

; b. Subroutine 

, 

; 1.2 Functional Description 

; This file contains the ACO1 initialization 
, 

, 

: 1.3 Specification/Design Reference (optional) 

, 

; 1.4 Module Test Document Reference 

7 Not done 

, 

; 1.5 Compilation Information 

5 Compiler: TMS320C54X ASSEMBLER 

; Version: 1.02 (PC) 

; Activation: asm500 -s aic_cfg.asm 

, 

: 1.6 Notes and Special Considerations 


10-42 PRELIMINARY 


PRELIMINARY Application Code 


Example 10-9. ‘AC01 Register Configuration (Continued) 


; 2. VOCABULARY 
; 
A 2.1 Definition of Special Words, Keywords (optional) 
; _ 
} 2.2 Local Compiler Flags 
; = 
; 2.3 Local Constants 
; - 
i} 
i{ 
Hi 3. EXTERNAL RESOURCES 
, 
: 3.1 Include Files 

-mmregs 

- include "ale _cfig..ine" 
; 3.2 External Data 
: 3.3 Import Functions 
i} 
i{ 
7 4. INTERNAL RESOURCES 
, 
? 4.1 Local Static Data 
; = 
; 4.2 Global Static Data 
; _ 
: 4.3 Dynamic Data 
; _ 
; 4.4 Temporary Data 
; _ 
: 4.5 Export Functions 

-def wrt_cnfg 

7} 
; 5. SUBROUTINE CODE 
: HeaderBegin 
, 
, 
pleted ee Se a SS a a tees 
; 5.1 wrt_cnfg 
, 
7 5.2 Functional Description 
: Writes new configuration data into the ACO1. Assuming a system 
; which processes speech signals and * requires the following parameters 
7 Low pass filter cut-off frequency = 3.6 kHz 
7 Sampling rate = 8000 Hz 
; Assume the Master clock MCLK = 10.368 MHz 
; This example demonstrates how to program these parameters - 
A the registers affected ar 
; Register A which determines the division of the MCLK frequency 
H to generate the internal filter clock FCLK. 
7 It also determines the -3 dB corner frequency of the low-pass filter 
7 Register B which determines the division of FCLK to generate 
; the sampling (conversion) frequency 
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7 5.4 
; 5.5 
; 5.6 
; 5.7 


10-9. 


‘AC01 Register Configuration (Continued) 


PRELIMINARY 


It also determines the -3dB corner frequency of the high-pass filter 


Activation 
Activation example: 


Da 
Da 


CALL 
STM 


Reentrancy: 
Recursive 


Inputs 
NONE 


Outputs 
NONE 


Global 


ta sStructu 


ta Format: 


Modified: 
Description: 


D 


wrt_cnfg 


#K_RINT1,IFR 


No 
No 


re: AR1 
16-bit 
No 


pointer 


indexes the table 


Special considerations for data structure 


: 5.8 Entry and Exit conditions 
, 
; Poe eae cel et ler AR1|AR2|AR3|AR4|AR5|AR6 pe |B |BK|BRc| T|TRN| 
j | | | | | | P| | | of ff 
v 
jin \ | 1 | 1 a | ia lag re i eo ae ee ee ho lids ts tisha | NU | NU 
; | 
, 
;out|u | 1 | 1 {NU [Nu |wu [uM |Nu |NU [uM |NU |NU |NU |UM|NU|NU|UM |NU|NU | 
, 
; Note UM —- Used & Modified, U - Used, NU - Not Used 
r 
; 5.9 Execution 
; Execution time: ?cycles 
7 Call rate: not applicable for this application 
; HeaderEnd 
; 5.10 Code 
r 
, 

asg AR1,AIC_REG_P 

-sect “aic_cnfg” 
wrt_cnfg: 

STM #aic_reg_tble,AIC_REG _P ; init AR1 

RPT #AIC_REG_END_LIST-AIC_REG_START_LIST 

MVPD #AIC_REG_START_LIST, *AIC_REG_P+ 7 move the table 

STM #aic_reg_tble,AIC_REG P ; init AR1 

STM #K_REG_0,DXR1 ; primary data word -— 

7 a jump start! 
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PRELIMINARY 


Application Code 


Example 10-9. ‘AC01 Register Configuration (Continued) 


wait_xrdy 


BITF SPC1,K_XRDY 

BC wait_xrdy,NTC 

STM #K_SCND_CONTRL, DXR1 

LD *AIC_REG_P+,A 

STLM A,BRC 

NOP 

RPTB aic_cfg_complte-1 
wait_xrdyl 

BITF SPC1,K_XRDY 

BC wait_xrdy1,NTC 

LD *AIC_REG_ P+,A 

STLM A, DXR1 
wait_xrdy2 

BITF SPC1,K_XRDY 

BC wait_xrdy2,NTC 

STM #K_SCND_CONTRL, DXR1 


aic_cfg_complte 
wait_xrdy3 


BITF SPC1,K_XRDY 
BC wait_xrdy3,NTC 
RE 

PRELIMINARY 


test XRDY bit in SPC1 

loop if not set 

send primary word with 

DO1-D0O0 = 11 to 

signify secondary communication 


gives the # of registers to be 
initialized 


test XRDY bit in SPC1l 
loop if not set 
Read the register contents 


test XRDY bit in SPC1l 

loop if not set 

set to read the next register 
contents 


test XRDY bit in SPC1l 
loop if not set 
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Example 10-10. Receive Interrupt Service Routine 


; TEXAS INSTRUMENTS INCORPORATED 

; DSP Data Communication System Development / ASP 
, 

; Archives: PVCS 

; Filename: rev_intl.asm 

; Version: Lx 

; Status draft ( } 

; proposal (X) 

7 accepted ( ) dd-mm-yy/?acceptor. 
, 

; AUTHOR Padma P. Mallela 


i{ 
; IPR st 
i} 


Application Specific Products 

Data Communication System Development 
12203 SW Freeway, MS 701 

Stafford, TX 77477 


atements description (can be collected). 


PRELIMINARY 


7 (C) Copyright 1996. Texas Instruments. All rights reserved. 


i{ 


; Change 
, 

v 

¥ 

, 

i} 

i{ 

7 diss 
r 

; dice al 
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history: 
VERSION DATE / AUTHORS COMM! 
10 July-29-96 / P.Mallela original 
ABSTRACT 
Function Type 


a.Core Routine 
b. Subroutine 


Functional Description 


ENT 
created 


This file contains interrput service routine INT1: 


receive_intl 
Specification/Design Reference (optional) 


Module Test Document Reference 
Not done 


Compilation Information 


Compiler: TMS320C54X ASSEMBLER 
Version: 1.02 (PC) 
Activation: asm500 -s rcev_intl.asm 


Notes and Special Considerations 


PRELIMINARY 


PRELIMINARY 


Example 10-10. Receive Interrupt Service Routine (Continued) 


i} 
r{ 


a 


VOCABULARY 


2.1 Definition of Special Words, Keywords (optional) 


2.2 Local Compiler Flags 


2.3 Local Constants 


3 EXTERNAL RESOURCES 


3'6 


3. 


Bs 


1 Include Files 


-mmregs 
-include “INTERRPT.INC” 
- include "main.inc” 

2 External Data 
.ref d_frame_flag 
.ref d_index_count 
.ref d_rcv_in_ptr,d_xmt_out_ptr 
.cef RCV_INT1_DP 


3 Import Functions 


ERNAL RESOURCES 


Local Static Data 


Global Static Data 


Dynamic Data 


Temporary Data 
Export Functions 
.deft receive_intl 


SUBROUTINE CODE 
HeaderBegin 


Application Code 


PRELIMINARY 


5.1 receive_intl 


5.2 Functional Description 
This routine services receive interruptl. Accumulator A, AR2 and AR3 

are pushed onto the stack since AR2 and AR3 are used in other 
applications. A 512 buffer size of both input and output uses circular 
addressing. After every 256 collection of input samples a flag is set to 
process the data. A PING/PONG buffering scheme is used such that upon 
processing PING buffer, samples are collected in the PONG buffer and vice 


versa. 
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Example 10-10. Receive Interrupt Service Routine (Continued) 


; 5.3 Activation 

: Activation example: 
; BD receive_intl 
i PSHM STO 

; PSHM ST1 

, 

; Reentrancy: No 

7 Recursive : No 

r 

; 5.4 Inputs 

; NONE 

, 

; 5.5 Outputs 

7 NONE 


; 5.6 Global 


; Data structure: 
; Data Format: 

: Modified: 

; Description: 

; 

; 

; 

H Data structure: 
; Data Format: 

: Modified: 

. Description: 

; 

; 

; 

; Data Structure: 
; Data Format: 

- Modified: 

; Description: 

; 

; 

; Data structure: 
; Data Format: 

. Modified: 

: Description: 

; 

; Data structure: 
7 Data Format: 

: Modified: 

. Description: 

; 

; 

; Data structure: 
; Data Format: 

; Modified: 

: Description: 

, 
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AR2 

16-bit input buffer pointer 

Yes 

either point to PING/PONG buffer. Upon entering 

AR2 is pushed onto stack and the address is restored 
through d_rcev_in_ptr 


AR3 

16-bit output buffer pointer 

Yes 

either point to PING/PONG buffer. Upon entering 

AR3 is pushed onto stack and the address is restored 
through d_rcev_in_ptr 


d_index_count 

16-bit var 

Yes 

holds the number samples that has been collected from 
ACOl 


d_frame_flag 

16-bit variable 

Yes 

flag is set if 256 samples are collected 


d_rcv_in_ptr 

16-bit var 

Yes 

holds the input buffer address where the newest 
sample is stored 


d_xmt_out_ptr 

16-bit variable 

Yes 

holds the output buffer address where the oldest 
sample is sent as output 
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PRELIMINARY Application Code 


Example 10-10. Receive Interrupt Service Routine (Continued) 


; 5.7 Special considerations for data structure 
’ 
A 5.8 Entry and Exit conditions 
¥ 
; Tak sk |G sad ubsibau sg is aceliS(isad  aa (? iia sai 
; | | } | | | | | 
jin |u | 1 | 1 |NU | 1 "? oe | NU ° U a ‘ai NU a ” iw “ee faa 
£ 
;out|u | 1 | 1 [wu | 1 [Nu [nu [nu |u |v [Nu |Nu |NU [NU |U |NU|U |NU |NU|NU 
rs 
; Note : UM - Used & Modified, U Used, NU Not Used 
F 
; 5.9 Execution 
; Execution time: ?cycles 
7 Call rate: not applicable for this application 
, 
, 
; HeaderEnd 
; 5.10 Code 
.asg AR2,GETFRM_IN_P ; get frame input data pointer 
.asg AR3, GETFRM_OUT_P 7 get frame output data pointer 
-asg AR2, SAVE_RSTORE_AR2 
-asg AR3, SAVE_RSTORE_AR3 
-sect "main_prg” 
receive_intl: 
PSHM AL 
PSHM AH 
PSHM AG 
PSHM SAVE_RSTORE_AR2 
PSHM SAVE_RSTORE_AR3 
PSHM BK 
PSHM BRC 
STM #2*K_ FRAME SIZE, BK ; circular buffr size of in,out 
; arrays 
LD #RCV_INT1_DP, DP 7; init. DP 
MVDK d_rcv_in_ptr, GETFRM_IN_P ; restore input circular bffr ptr 
MVDK d_xmt_out_ptr,GETFRM_OUT_P ; restore output circular bffr ptr 
ADDM #1,d_index_count ; increment the index count 
LD #K_FRAME_SIZE,A 
SUB d_index_count, A 
BC get_samples, AGT ;check for a frame of samples 
frame_flag_set 
ST #K_FRAME_FLAG,d_frame_flag ; set frame flag 
ST #0,d_index_count ; reset the counter 


PRELIMINARY Application Code Examples 10-49 


Application Code 


Example 10-10. 


get_samples 
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LDM 
STL 
LD 

AND 


PRELIMINARY 


Receive Interrupt Service Routine (Continued) 


DRR1,A 

A, *GETFRM_IN_P+% 
*GETFRM_OUT_P+%,A 
#0fffch,A 

A,DXR1 
GETFRM_IN_P,d_rcv_in_ptr 
GETFRM_OUT_P, d_xmt_out_ptr 
BRC 


RSTORE_AR3 
RSTORE_AR2 


; load the input sample 

; write to buffer 

; if not true, then the filtered 
; Signal is send as output 

; write to DXR1L 

; save input circualr buffer ptr 
7 save out circular bffr ptr 


; return and enable interrupts 
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Example 10-11. = Task Scheduling 


; TEXAS INSTRUMENTS INCORPORATED 
; DSP Data Communication System Development / ASP 


; Archives: PVCS 

; Filename: task.asm 

; Version: 1.0 

; Status +: draft 

; proposal (X) 

; accepted (_) dd-mm-yy/?acceptor. 
Tv 

; AUTHOR Padma P. Mallela 

’ 

; Application Specific Products 

; Data Communication System Development 
; 12203 SW Freeway, MS 701 

; Stafford, TX 77477 


i { 

£ 

; IPR statements description (can be collected). 

rp 

ti 

7 (C) Copyright 1996. Texas Instruments. All rights reserved. 


i { 
Fd 
; Change history: 


H VERSION DATE 4 AUTHORS COMMENT 
; Ls: July-29-96 / P.Mallela original created 
, 

i} 

i{ 

; 1. ABSTRACT 

, 

; 1.1 Function Type 

7 a.Core Routine 

: b.Subroutine 

, 

7 1.2 Functional Description 

7 This file initiates task scheduling 

, 

, 

: 1.3 Specification/Design Reference (optional) 
, 

; 1.4 Module Test Document Reference 

; Not done 

, 

; 1.5 Compilation Information 

; Compiler: TMS320C54X ASSEMBLER 

; Version: 1.02 (PC) 

7 Activation: asm500 -s task.asm 

, 

; 1.6 Notes and Special Considerations 


; This code is written for 541 device. The code is tested on C54x EVM 
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Example 10-11. Task Scheduling (Continued) 


i} 
i { 


, 


7 2. VOCABULARY 
, 
; 2.1 Definition of Special Words, Keywords (optional) 
¥ 
; 2.2 Local Compiler Flags 
, 
; 2.3 Local Constants 
; = 
i} 
i{ 
H 3. EXTERNAL RESOURCES 
¥ 
; 3.1 Include Files 
. include “init_54x.inc” 
. include "main.inc” 
; 3.2 External Data 
.ref d_task_addr, d_task_init_addr 
.ref d_buffer_count 
.ref present_command 
.cef last_command 
.ref d_output_addr, d_input_addr 
.ref input_data, output_data 
.ref d_frame_flag 
.ref task_init_list,task_list 
; 3.3 Import Functions 
.ref echo_task 
.cef fir_init,fir_task 
.ref do_nothing, no_echo_init_task 
.cef fir_init,fir_task 
.cef iir_init,iir_task 
.ref sym_fir_task,sym_fir_init 
.ref adapt_init, adapt_task 
.ref rfft_task 
i} 
i{ 
; 4. INTERNAL RESOURCES 
, 
7 4.1 Local Static Data 
rd 
7 4.2 Global Static Data 
, 
A 4.3 Dynamic Data 
¥ 
; 4.4 Temporary Data 
, 
7 4.5 Export Functions 
.def task_handler 


10-52 PRELIMINARY 


PRELIMINARY Application Code 


Example 10-11. Task Scheduling (Continued) 


; 5. SUBROUTINE CODE 

7 HeaderBegin 

, 

3 

j ---------------------------------------------~-------~------------~~~--------+ 
. 5.1 task_handler 

ta 

; 5.2 Functional Description 

; This routine handles the task scheduling. The present_command 
; can take values 1,2,3,4,5,6. If 

A present_command = 1 - Echo program is enabled 

; present_command = 2 - FIR is enabled 

; present_command = 3 - IIR is enabled 

: present_command = 4 - Symmetric FIR is enabled 

; present_command = 5 - Adaptive fitler is enabled 

; present_command = 6 - FFT is enabled 

; For every cycle the program checks if the current task is same 
7 as previous task if it is, then no initialization is done. 
: If its not then cirular buffers, variables pointers are intialized 
; depending upon the task. 

joes See ee ee ee 
, 

; 5.3 Activation 

; Activation example: 

; CALL task_handler 

; Reentrancy: No 

: Recursive : No 

, 

: 5.4 Inputs 

ta 

; Data structure: present_command 

; Data Format: 16-bit variable 

; Modified: Yes 

7 Description: holds the present command 

v 

7 5.5 Outputs 

¥ 

‘ Data structure: AR6 

7 Data Format: 16-bit input buffer pointer 

7 Modified: Yes 

; Description: either point to PING/PONG buffer 

, 

: Data structure: AR7 

7 Data Format: 16-bit output buffer pointer 

; Modified: Yes 

; Description: either point to PING/PONG buffer 

, 

. 5.6 Global 

’ 

- Data structure: last_command 

; Data Format: 16-bit variable 

; Modified: Yes 

; Description: holds the last command 
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Example 10-11. 


Data structure: 
Data Format: 
Modified: 
Description: 


Data structures: 
Data Format: 
Modified: 
Description: 


Data structures 
Data Format: 
Modified: 
Description: 


Data structures 
Data Format: 
Modified: 
Description: 


Data structure: 
Data Format: 
Modified: 
Description: 


Data structure: 
Data Format: 
Modified: 
Description: 


Data structure: 
Data Format: 
Modified: 
Description: 


Data structure: 
Data Format: 


PRELIMINARY 


Task Scheduling (Continued) 


d_frame_flag 

16-bit variable 

Yes 

gets reset after 256 samples 


d_buffer_count 

16-bit variable 

Yes 

used to load either PING/PONG bffr addresses 


input_data 
16-bit array 
Yes 


address of the input data buffer 


output_data 

16-bit array 

Yes 

address of the output data buffer 


d_input_addr 

16-bit variable 

Yes 

holds either PING/PONG address 


d_output_addr 

16-bit variable 

Yes 

holds either PING/PONG address 


d_task_addr 

16-bit variable 

Yes 

holds the task program address 


d_task_init_addr 
16-bit variable 


; Modified: Yes 

7 Description: holds the task init. address 

5.7 Special considerations for data structure 

: 5.8 Entry and Exit conditions 

|DP | OVM| SXM|C16|FRCT|ASM|ARO|AR1|AR2|AR3|AR4|AR5|AR6|AR7|A |B |BK|BRC| T|TRN 


jin |U 
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1 | 1 [Nu | 1 


Etats 1 | 1 [Nu | 1 Res ia is nn ' ce e ls a UM 


| 
[Nu |NU |NU |NU [NU [NU |NU [NU |U_ [NU 

| 

| 


|BK|BRC| T| 

| | | 
|NU|NU|NU |NU|NU 
ola a fo 


NU|NU 


UM - Used & Modified, U - Used, NU - Not Used 
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PRELIMINARY 


Example 10-11. Task Scheduling (Continued) 


; 5.9 Execution 
; Execution time: ?cycles 
: Call rate: not applicable for this 


Application Code 


application 


; HeaderEnd 


; 5.10 Code 
.asg AR6, INBUF_P ; 
.asg AR7, OUTBUF_P ; 
-sect "task_hnd” 


task_handler: 


; LD #TASK_ VAR_DP,DP 
ST #K_0,d_frame_flag ; 
ADDM #1,d_buffer_count 
LD #input_data,A ; 
LD #output_data,B ; 
BITF d_buffer_count,2 ; 
BC reset_buffer_address, NTC ; 
ADD #K_FRAME SIZE,A 7 
ADD #K_FRAME SIZE,B 7 
ST #K_0,d_buffer_count : 
reset_buffer_address 
STLM A, INBUF_P : 
STi A, d_input_addr ; 
STLM B, OUTBUF_P : 
STL B, d_output_addr ; 
LD present_command,A 
SUB last_command,A 
BC new_task, ANEQ ; 
LD d_task_addr,A Hi 
v 
r 
CALA A ; 
RET 
new_task: 
MVKD present_command,last_command j; 
LD #task_init_list,A : 
ADD present_command,A : 
REDA d_task_init_addr ; 
LD d_task_init_addr,A 
CALA A ; 
LD #task_list,A; 
ADD present_command,A H 
READA d_task_addr ; 
v 
LD d_task_addr,A 
CALA A 
RE 
do_nothing: 
RE 
no_echo_init_task: 
RE 
.end 
PRELIMINARY 


PING/PONG input buffer 
PING/PONG output buffer 


reset the frame flag 


load PING input address 
load PING output address 
check if PING/PONG address 
needs to be loaded 

PONG input address 

PONG output address 

reset counter 


input buffer address 
restore either PING/PONG bffr 
output buffer address 
restore either PING/PONG bffr 


check if PC = LC 

task_addr should 

contain previous 

PC = LC 

call present task 


restore the present command 
loads PC init task 

computes the present task 
save the PC into task_addr 


initializes the present task 
computes the present task 


save the PC into 
task_addr 
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Example 10-12. — Echo the Input Signal 


; TEXAS INSTRUMENTS INCORPORATED 
; DSP Data Communication System Development / ASP 


; Archives: PVCS 

; Filename: echo.asm 

; Version: LO) 

; Status : draft 

7 proposal (X) 

7 accepted ( ) dd-mm-yy/?acceptor. 
, 

; AUTHOR Padma P. Mallela 

v 

. Application Specific Products 

; Data Communication System Development 
; 12203 SW Freeway, MS 701 

7 Stafford, TX 77477 


; IPR statements description (can be collected). 
7 (C) Copyright 1996. Texas Instruments. All rights reserved. 


; Change history: 


; VERSION DATE / AUTHORS COMMENT 
7 1.10 July-24-96 / P.Mallela original created 
, 

i} 

i{ 

; 1. ABSTRACT 

, 

; 1.1 Function Type 

; a.Core Routine 

; b. Subroutine 

, 

; 1.2 Functional Description 

A This file contains one subroutines: 

; echo_task 

7 1.3 Specification/Design Reference (optional) 

7 called by task.asm depending upon the task 
, 

H 1.4 Module Test Document Reference 

; Not done 

, 

; 1.5 Compilation Information 

; Compiler: TMS320C54X ASSEMBLER 

; Version: 1.02 (PC) 

7 Activation: asm500 -s echo.asm 

, 

7 1.6 Notes and Special Considerations 

; = 

i} 
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Example 10-12. Echo the Input Signal (Continued) 


Application Code 
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7 2. VOCABULARY 
; 
: 2.1 Definition of Special Words, Keywords (optional) 
; = 
; 2.2 Local Compiler Flags 
; = 
‘ 2.3 Local Constants 
; _ 
i} 
i{ 
H 3. EXTERNAL RESOURCES 
, 
7 3.1 Include Files 
-mmregs 
- include "main.inc” 
. 3.2 External Data 
; 3.3 Import Functions 
i} 
i{ 
} 4. INTERNAL RESOURCES 
, 
; 4.1 Local Static Data 
; _ 
é 4.2 Global Static Data 
; Pa 
; 4.3 Dynamic Data 
; = 
; 4.4 Temporary Data 
; _ 
: 4.5 Export Functions 
.def echo_task 
i} 
; 5. SUBROUTINE CODE 
: HeaderBegin 
, 
, 
j --------------------------------------------~+-----+ ~~~ + ~~~ ~~ =~ ~~~ ~~~ ~~~ +--+ 
; 5.1 echo_task 
, 
: 5.2 Functional Description 
; This function reads a sample from either PING/PONG buffer and puts it 
7 back in the output buffer. This is repeated 256 times i.e., size of the 
7 frame. The present command in this case is 1. 
pea eo a a a a a a 
; 
; 5.3 Activation 
: Activation example: 
: CALL echo_task 
; Reentrancy: No 
; Recursive : No 
, 
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Example 10-12. Echo the Input Signal (Continued) 


PRELIMINARY 


; 5.4 Inputs 

, 

7 Data structure: AR6 

; Data Format: 16-bit input buffer pointer 

H Modified: Yes 

; Description: either point to PING/PONG buffer 

, 

; 5.5 Outputs 

, 

i Data structure: AR7 

; Data Format: 16-bit output buffer pointer 

; Modified: Yes 

' Description: either point to PING/PONG buffer 

, 

; 5.6 Global 

, 

, 

7 5.7 Special considerations for data structure 

; 2 

7 5.8 Entry and Exit conditions 

; 

: |DP | OvM| SXM|C16|FRCT|ASM|ARO|AR1|AR2|AR3|AR4|AR5S|AR6|AR7|A |B |BK|BRC| T|TRN| 
| | ee eee 
jin F 1 | 1 [nu | 1 |NuU |NU |NU ’ [Nu [NU |NU JU |U eee |NU| NU | 
| 

;out|U | 1 | 1 [Nu | 1 [NU [NU |NU [NU |NU [NU |NU lai e eee ase | 
; 

; Note : UM - Used & Modified, U - Used, NU - Not Used 

; 

7 5.9 Execution 

: Execution time: ?cycles 

; Call rate: not applicable for this application 


ror 


; HeaderEnd 


; 5.10 Code 
.asg AR6, INBUF_P ; 
-asg AR7, OUTBUF_P 7 
-sect “echo_prg” 

echo_task: 
STM #K_FRAME_SIZE-1,BRC ; 
RPTB echo_loop-1 
LD *INBUF_P+, A ; 
STL A, *OUTBUF_P+ 

echo_loop 
RET ; 
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PING/PONG input buffer 
PING/PONG output buffer 


frame size of 256 


load the input value 


output buffer 
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Example 10-13. | Low-Pass FIR Filtering Using MAC Instruction 


; TEXAS INSTRUMENTS INCORPORATED 

; DSP Data Communication System Development / ASP 

r 

; Archives: PVCS 

; Filename: fir.asm 

; Version: 1,'0 

; Status draft 

; proposal (X) 

; accepted (_) dd-mm-yy/?acceptor. 
, 

; AUTHOR Padma P. Mallela 

, 

; Application Specific Products 

: Data Communication System Development 
; 12203 SW Freeway, MS 701 

; Stafford, TX 77477 

i { 

£ 

; IPR statements description (can be collected). 

i} 

ti 

7 (C) Copyright 1996. Texas Instruments. All rights reserved. 


if 


Change 


Lol 


history: 
VERSION DATE i AUTHORS COMMENT 
1.0 July-26-96 / P.Mallela original created 
ABSTRACT 
Function Type 
a.Core Routine 
b. Subroutine 
Functional Description 
This file contains two subroutines: 
1) fir_init 
2) fir_task 
Specification/Design Reference (optional) 
called by task.asm depending upon the task thru CALA 
Module Test Document Reference 
Not done 
Compilation Information 
Compiler: TMS320C54X ASSEMBLER 
Version: 1.02. (PC) 
Activation: asm500 -s fir.asm 


Notes and Special Considerations 
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Example 10-13. Low-Pass FIR Filtering Using MAC Instruction (Continued) 


; 2. VOCABULARY 


; 2.1 Definition of Special Words, Keywords (optional) 
; = 
; 2.2 Local Compiler Flags 
; a 
; 2.3 Local Constants 
; = 
i} 
i{ 
; 3. EXTERNAL RESOURCES 
r 
; 3.1 Include Files 
-mmregs 
- include "main.inc” 
; 3.2 External Data 
.ref d_filin , filter input 
.ref ad_fi lowe , filter output 
.ref d_data_buffer 
.cef fir_task 
.ref COFF_FIR_START, COFF_FIR_END 
.ref fir_coff_table 
; 3.3 Import Functions 
i} 
i{ 
; 4. INTERNAL RESOURCES 
r 
; 4.1 Local Static Data 
; = 
; 4.2 Global Static Data 
; = 
; 4.3 Dynamic Data 
; = 
; 4.4 Temporary Data 
; ee 
7 4.5 Export Functions 
.def fir_init ; initialize FIR filter 
.def fir_filter ; perform FIR filtering 
i} 
r 5. SUBROUTINE CODE 
; HeaderBegin 
r 
r 
j ------------------- 7 == eee ass ae a = 
; Sel ELE anit 
, 
7 5.1.1 Functional Description 
; This routine initializes cicular buffers both for data and coeffs. 
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Example 10-13. Low-Pass FIR Filtering Using MAC Instruction (Continued) 


; 5.1.2 Activation 
: Activation example: 
; CALL fir init 
; Reentrancy: No 
7 Recursive No 
, 
: 5.1.3 Inputs 
; NONE 
; 5.1.4 Outputs 
; NONE 
, 
; 5.1.5 Global 
, 
; Data structure: ARO 
7 Data Format: 16-bit index pointer 
; Modified: No 
: Description: uses in circular addressing mode for indexing 
, 
: Data structure: AR4 
; Data Format: 16-bit x(n) data buffer pointer 
; Modified: Yes 
; Description: initializes the pointer 
, 
; Data structure: AR5 
; Data Format: 16-bit w(n) coeff buffer pointer 
; Modified: Yes 
; Description: initializes the pointer 
, 
- 5.1.6 Special considerations for data structure 
; = 
; 5.1.7 Entry and Exit conditions 
; 
: | DP | OVM| SXM|C16|FRCT|ASM|ARO|AR1|AR2|AR3|AR4|AR5|AR6|AR7|A |B |BK|BRC| T|TRN| 
; | | | | | 
jin ju | 1 | al ae | 1 = |NU [NU S | NU A hes is es e aoe | NU | NU 
- | | | } | | | || 
sout|u | 1 | 1 [Nu | 1 [NU |UM |NU |NU wo |UM |UM ' |NU |UM|NU|NU|NU_ |NU|NU 
, 
; Note UM - Used & Modified, U - Used, NU - Not Used 
, 
; 5.1.8 Execution 
: Execution time: ?cycles 
; Call rate: not applicable for this application 
ro 
; HeaderEnd 
; 5.1.9 Code 
-asg ARO, FIR_INDEX_P 
-asg AR4,FIR_DATA_P 
-asg AR5,FIR_COFF_P 
-sect "fir _prog” 
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Example 10-13. Low-Pass FIR Filtering Using MAC Instruction (Continued) 


fir Tn aes 


STM #fir_coff_table, FIR_COFF_P 
RPT #K_FIR_BFFR-1 , move FIR coeffs from program 
MVPD #COFF_FIR_START, *FIR_COFF_P+ ; to data 
STM #K_FIR_INDEX, FIR_INDEX_P 
STM #d_data_buffer,FIR_DATA P ; load cir_bfr address for the 
; recent samples 
RPTZ A, #K_FIR_BFFR 
STL A, *FIR_DATA_ P+ ; reset the buffer 
STM # (d_data_buffer+K_FIR_BFFR-1), FIR_DATA_P 
RETD 
STM #fir_coff_table, FIR_COFF_P 
; 5. SUBROUTINE CODE 
; HeaderBegin 
, 
rd 
j -------------------------------------~----~~+~----~ ~~~ ~~~ ~~~ ~~~ ~~~ +--+ ++ 
: 5.2 fir_task 
¥ 
; 5.2.1 Functional Description 
v 
; This subroutine performs FIR filtering using MAC instruction. 
; accumulator A (filter output) = h(n)*x(n-i) for i = 0,1...15 
j--------- peaeee ane pa ot i sn Aa Eterceneuees ee pepe pe Steepee ene aN 
, 
; 5.2.2 Activation 
; Activation example: 
; CALL fir_task 
; Reentrancy: No 
7 Recursive : No 
, 
; 5.2.3 Inputs 
, 
: Data structure: AR6 
; Data Format: 16-bit input buffer pointer 
A Modified: Yes 
‘ Description: either point to PING/PONG buffer 
, 
; Data structure: AR4 
; Data Format: 16-bit data buffer pointer 
H Modified: Yes 
; Description: uses circular buffer addressing mode to filter 
; 16 tap Low-Pass filter - init. in fir_init 
, 
; Data structure: AR5 
7 Data Format: 16-bit coefficient buffer pointer 
: Modified: Yes 
7 Description: The 16 tap coeffs comprise the low-pass filter 
: init. in fir_init 
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Example 10-13. Low-Pass FIR Filtering Using MAC Instruction (Continued) 


; 5.2.4 Outputs 

, 

; Data structure: AR7 

7 Data Format: 16-bit output buffer pointer 

; Modified: Yes 

; Description: either point to PING/PONG buffer 

, 

, 

; 5.2.5 Global 

; NONE 

, 

; 5.2.6 Special considerations for data structure 

; _ 

7 5.2.7 Entry and Exit conditions 

; 

; 

7 | DP | OVM| SXM|C16|FRCT|ASM|ARO|AR1|AR2|AR3|AR4|AR5|AR6|AR7|A |B |BK|BRC| T|TRN| 
> | | | | | | | | | | | 
jin ju | 1 | 1 [nu | 1. [Nu [Nu [NU [NU [NU |U [NU |U | U |UM|NU|NU|NU |NU|NU | 
: | 

pout |U | 1 | 1 eG | Hl i | UM NU |NU |NU |UM |UM |UM |UM |UM/NU|UM/UM |NU|NU 


; Note : UM - Used & Modified, U Used, NU Not Used 


; 5.2.8 Execution 
; Execution time: ?cycles 
; Call rate: not applicable for this application 


ror 
; HeaderEnd 


; 5.2.9 Code 
.asg AR6, INBUF_P 
.asg AR7, OUTBUF_P 
-asg AR4,FIR_DATA _P 
-asg AR5,FIR_COFF_P 
-sect "fir _prog” 
fir_task: 
; LD #FIR_DP, DP 
STM #K_FRAME SIZE-1,BRC ; Repeat 256 times 
RPTBD fir_filter_loop-1 
STM #K_FIR_BFFR, BK ; FIR circular bffr size 
LD *INBUF_P+, A ; load the input value 
fir filter 
STL A, *FIR_DATA_P+% ; replace oldest sample with newest 
7 sample 
RPTZ A, (K_FIR_BFFR-1) 
MAC *FIR_DATA _P+0%,*FIR_COFF_P+03%,A ; filtering 
STH A, *OUTBUF_P+ ; replace the oldest bffr value 
fir_filter_loop 
RET 
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10-14. | Low-Pass Symmetric FIR Filtering Using FIRS Instruction 


; TEXAS INSTRUMENTS INCORPORATED 
; DSP Data Communication System Development / ASP 


; Archives: PVCS 

; Filename: sym_fir.asm 

; Version: 1.0 

3; Status draft () 

: proposal (X) 

; accepted ( ) dd-mm-yy/?acceptor. 
sf 

; AUTHOR Padma P. Mallela 


if 


Application Specific Products 

Data Communication System Development 
12203 SW Freeway, MS 701 

Stafford, TX 77477 


; IPR statements description (can be collected). 


i} 


7 (C) Copyright 1996. Texas Instruments. All rights reserved. 


i{ 


; Change history: 

; 

; VERSION DATE / AUTHORS COMMENT 

7 1.0 July-26-96 / P.Mallela original created 
, 

i} 

i{ 

; 1. ABSTRACT 

, 

; 1.1 Function Type 


a.Core Routine 
b. Subroutine 


Functional Description 

This file contains two subroutines: 
1) sym_fir_init 

2) sym_fir_task 


Specification/Design Reference (optional) 
called by task.asm depending upon the task thru CALA 


Module Test Document Reference 
Not done 


Compilation Information 


Compiler: TMS320C54X ASSEMBLER 
Version: 1.02 (PC) 
Activation: asm500 -s sym_fir.asm 


Notes and Special Considerations 
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Example 10-14. Low-Pass Symmetric FIR Filtering Using FIRS Instruction (Continued) 


; 2. VOCABULARY 
; 
A 2.1 Definition of Special Words, Keywords (optional) 
; = 
} 2.2 Local Compiler Flags 
; _ 
: 2.3 Local Constants 
; = 
i} 
i{ 
Hi 3. EXTERNAL RESOURCES 
, 
; 3.1 Include Files 
-mmregs 
- include "main.inc” 
7 3.2 External Data 
.ref d_datax_buffer 
.ref d_datay_buffer 
.ref FIR_COFF 
; 3.3 Import Functions 
i} 
i{ 
: 4. INTERNAL RESOURCES 
, 
: 4.1 Local Static Data 
; = 
7 4.2 Global Static Data 
; = 
; 4.3 Dynamic Data 
; = 
; 4.4 Temporary Data 
; - 
: 4.5 Export Functions 
.def sym_fir_init ; initialize symmetric FIR 
.def sym_fir_task 
i} 
; 5. SUBROUTINE CODE 
: HeaderBegin 
, 
, 
SS aS ae ee 8 oe ee 
A Sul sym_fir_init 
, 
7 5.1.1 Functional Description 
: This routine initializes cicular buffers both for data and coeffs. 
ee ee 
, 
: 5.1.2 Activation 
; Activation example: 
; CALL sym_fir_init 
; Reéntrancy: No 
A Recursive : No 
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Example 10-14. Low-Pass Symmetric FIR Filtering Using FIRS Instruction (Continued) 


; Si1.3 Inputs 
; NONE 
; 5.1.4 Outputs 
; NONE 
, 
7 5.1.5 Global 
, 
7 Data structure: ARO 
7 Data Format: 16-bit index pointer 
; Modified: No 
; Description: uses in circular addressing mode for indexing 
, 
7 Data structure: AR4 
7 Data Format: 16-bit x(n) data buffer pointer for 8 latest samples 
H Modified: Yes 
7 Description: initializes the pointer 
, 
; Data structure: AR5 
; Data Format: 16-bit x(n) data buffer pointer for 8 oldest samples 
; Modified: Yes 
7 Description: initializes the pointer 
, 
: 5.1.6 Special considerations for data structure 
; - 
- 5.1.7 Entry and Exit conditions 
, 
; |DP |ovm|SxM|c16|FRCT|ASM|ARO|AR1|AR2|AR3|AR4|AR5|AR6|AR7|A |B |BK|BRC| T|TRN 
z. || | | ee eee oe ; | | | of | 
;in Ju | 1 | 1 [nu | 1) [nu [NU |NU [NU |NU [NU |NU [NU [NU |UM|NU|NU|NU |NU|NU 
; Poe ee ee 
;out|u | 1 | 1 [Nu | 1) [Nu [uM [Nu [NU [NU |U [UM [NU |NU |UM|NU|NU|NU |NU|NU 
, 
; Note UM - Used & Modified, U - Used, NU - Not Used 
, 
; 5.1.8 Execution 
. Execution time: ?cycles 
; Call rate: not applicable for this application 
, 
, 
; HeaderEnd 
; 5.1.9 Code 

-asg ARO, SYMFTR_INDEX_P 

-asg AR4,SYMFIR_DATX_P 

-asg AR5,SYMFIR_DATY_P 

-sect “sym_fir” 
sym_fir_init: 

STM #d_datax_buffer, SYMFIR_DATX_P ; load cir_bfr address 

; for the 8 most 
; recent samples 
STM #d_datay_buffer+K_FIR_BFFR/2-1, SYMFIR_DATY_P 
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Example 10-14. Low-Pass Symmetric FIR Filtering Using FIRS Instruction (Continued) 


STM #K_negl, SYMFIR_INDEX_P index offset - 
whenever the pointer 
hits the top of the bffr, 
it automatically hits 
bottom address of 
buffer and decrements 
the counter 

RPTZ A, #K_FIR_BFFR 

STL A,* SYMFIR_DATX_P+ 

STM #d_datax_buffer, SYMFIR_DATX_P 

RPTZ A, #K_FIR_BFFR 

STL A,* SYMFIR_DATY_P- 

RETD 

STM #d_datay_buffer+K_FIR_BFFR/2-1, SYMFIR_DATY_P 

; 5. SUBROUTINE CODE 

7 HeaderBegin 

¥ 

F 

a 
; 5.2 -Sym_fir_init 

, 

; 5.2.1 Functional Description 

; This program uses the FIRS instruction to implement symmetric FIR 

; filter. Circular addressing is used for data buffers.The input scheme 
; for the data samples is divided into two circular buffers. The first 
; buffer contains samples from X(-N/2) to X(-1) and the second buffer 
; contains samples from X(-N) to X(-N/2-1). 

fe a a 
; 

; 5.2.2 Activation 

; Activation example: 

: CALL sym_fir_init 

; Reentrancy: No 

; Recursive No 

, 

7 D223 Inputs 

¥ 

; Data structure: AR6 

7 Data Format: 16-bit input buffer pointer 

: Modified: Yes 

; Description: either point to PING/PONG buffer 

v 

; Data structure: AR4 

: Data Format: 16-bit data buffer pointer 

; Modified: Yes 

; Description: uses circular buffer addressing mode to filter 

; 16 tap Low-Pass filter - init. in sym_fir_init 

v 

; Data structure: AR5 

7 Data Format: 16-bit coefficient buffer pointer 

; Modified: Yes 

7 Description: The 16 tap coeffs comprise the low-pass filter 
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init. in sym_fir_init 
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Example 10-14. Low-Pass Symmetric FIR Filtering Using FIRS Instruction (Continued) 


7 5.2.4 Outputs 
, 
; Data structure: AR7 
; Data Format: 16-bit output buffer pointer 
; Modified: Yes 
; Description: either point to PING/PONG buffer 
¥ 
7 5.2.5 Global 
; NONE 
, 
; 5.2.6 Special considerations for data structure 
¥ 
; 5.2.7 Entry and Exit conditions 
rd 
; | DP | OvM| SxXM|C16|FRCT|ASM|ARO|AR1|AR2|AR3|AR4|AR5|AR6|AR7|A |B |BK|BRC| T|TRN 
> | | | Lo ee ee ee 
jin i | 1 | 1 |NU | 1 [nu [nu [nu [Nu [nu [vu |[uU |U | U as fis hs ies 
, 
yout ju | 1 {21 [nu | 1 [nu [um [Nu [NU [NU |UM |UM [NU |NU |UM|UM|NU|UM |NU|NU 
, 
; Note : UM - Used & Modified, U - Used, NU - Not Used 
r 
; 5.2.8 Execution 
. Execution time: ?cycles 
; Call rate: not applicable for this application 
, 
r 
; HeaderEnd 
; 5.2.9 Code 
-asg AR6, INBUF_P 
.asg AR7, OUTBUF_P 
-asg AR4, SYMFTR_DATX_P 
-asg AR5,SYMFTIR_DATY_P 
-sect "sym_fir” 
sym_fir_task: 
STM #K_FRAME_SIZE-1,BRC 
RPTBD sym_fir_filter_loop-1 
STM #K_FIR_BFFR/2,BK 
LD *INBUF_P+, B 
symmetric_fir: 
MVDD *SYMFIR_DATX_P, *SYMFIR_DATY_P+0% ; move X(-N/2) to X(-N) 
STL B, *SYMFIR_DATX_P ; replace oldest sample with newest 
; sample 
ADD *SYMFIR_DATX_P+0%, *SYMFIR_DATY_P+0%,A ; add X(0)+X(-N/2-1) 
RPTZ B,#(K_FIR_BFFR/2-1) 
FIRS *SYMFIR_DATX_P+0%, *SYMFTR_DATY_P+0%,FIR_COFF 
MAR *+SYMFIR_DATX_P(2)% ; to load the next newest sample 
MAR *SYMFIR_DATY_P+% ; position for the X(-N/2) sample 
STH B, *OUTBUF_P+ 


sym_fir_filter_lo 
RET 
.end 
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Example 10-15. | Low-Pass Biquad IIR Filter 


ee 


EXAS INSTRUMENTS INCORPORATED 


; DSP Data Communication System Development / ASP 


; Archives: PVCS 

; Filename: iir.asm 

; Version: 1/0 

7; Status : draft () 

; proposal (X) 

; accepted (_) dd-mm-yy/?acceptor. 
v 

; AUTHOR: Padma P. Mallela 


if 


Application Specific Products 

Data Communication System Development 
12203 SW Freeway, MS 701 

Stafford, TX 77477 


; IPR statements description (can be collected). 


i} 
7(C 


if 


) Copyright 1996. Texas Instruments. All rights reserved. 


; Change history: 


VERSION DATE / AUTHORS COMMENT 
1.0 July-26-96 / P.Mallela original created 


1. ABSTRACT 


1.1 Function Type 
a.Core Routine 
b.Subroutine 


1.2 Functional Description 
This file contains two subroutines: 
1) Dir ante 
2) iir_task 
1.3 Specification/Design Reference (optional) 
called by task.asm depending upon the task thru CALA 


1.4 Module Test Document Reference 
Not done 


1.5 Compilation Information 


Compiler: TMS320C54X ASSEMBLER 
Version: 1.02 (PC) 
Activation: asm500 -s iir.asm 


1.6 Notes and Special Considerations 
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Example 10-15. Low-Pass Biquad IIR Filter (Continued) 


i 2. VOCABULARY 
r 
; 2.1 Definition of Special Words, Keywords (optional) 
; = 
; 2.2 Local Compiler Flags 
; a 
; 2.3 Local Constants 
; = 
i} 
i{ 
i 3. EXTERNAL RESOURCES 
, 
; 3.1 Include Files 
-mmregs 
-include "main.inc” 
; 3.2 External Data 
.ref d_iir_y 
.ref fo lille alo | 
.ref iir_table_start,iir_coff_table 
; 3.3 Import Functions 
i} 
i{ 
; 4. INTERNAL RESOURCES 
r 
; 4.1 Local Static Data 
; = 
; 4.2 Global Static Data 
; = 
; 4.3 Dynamic Data 
; = 
; 4.4 Temporary Data 
; Ps 
; 4.5 Export Functions 
.def iir_init 
.def iir_task 
i} 
; 5. SUBROUTINE CODE 
7 HeaderBegin 
, 
, 
Jee oeee Sao ee ee eee ee Se Se ee ee a 
7 Sel Lie Ine 
r 
e 5.1.1 Functional Description 
; A) This routine initializes buffers both for data and coeffs. 
a ee 
; 
7 5.1.2 Activation 
: Activation example: 
; CALL Tie Init 
; Reentrancy: No 
7 Recursive : No 
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Example 10-15. Low-Pass Biquad IIR Filter (Continued) 


7 S13 Inputs 
7 NONE 
: 5.1.4 Outputs 
H NONE 
; 5.1.5 Global 
Hi NONE 
; 5.1.6 Special considerations for data structure 
v 
; 5.1.7 Entry and Exit conditions 
v 
; ee si ie ake GEA csc la uk es . Een sie 
’ 
yin |u | 1 | 1 be | 1 ha bi ie ia ry Pe Pe | NU i ae lied 
ti 
;out|u | 1 | 1 {wu | 1 [Nu [NU |NU [NU |NU |UM |UM |NU |NU |UM|NU|NU|UM |UM|NU 
t 
; Note : UM - Used & Modified, U Used, NU Not Used 
¥ 
; 5.1.8 Execution 
; Execution time: ?cycles 
: Call rate: not applicable for this application 
, 
, 
; HeaderEnd 
: 5.1.9 Code 
-asg AR5,IIR_DATA_P ; data samples pointer 
-asg AR4,IIR_COFF_P ; IIR filter coffs pointer 
-sect "iir” 
iir_init: 
STM #iir_coff_table, IIR_COFF_P 
RPT #K_IITR_SIZE-1 7 move IIR coeffs from program 
MVPD #iir_table_start, *IIR_COFF_P+t+ ; to data 
LD #IIR_DP,DP 
STM #d_iir_d, IIR_DATA_P ;AR5:d(n),d(n-1),d(n-2) 
RPTZ A,#5 jyinitial d(n),d(n-1),d(n-2)=0 
STL A, *IIR_DATA_P+ 
RET 
; 5. SUBROUTINE CODE 
7 HeaderBegin 
' 
, 
p------ = eee ares goo hn = ae oa aera 
, 5.2 iir_task 
v 
; 5.2.1 Functional Description 
tv 
A This subroutine performs IIR filtering using biquad sections 
; IIR Low pass filter design 
7 Filter type : Elliptic Filter 
; Filter order : 4 order (cascade: 2nd order + 2nd order) 
; cut freq. of pass band : 200 Hz 
; cut freq. of stop band : 500 Hz 
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Example 10-15. Low-Pass Biquad IIR Filter (Continued) 


; BO 

P eye ====S= > + ------ > din) === -=>> 4 Se = 

; | | | 

; | Al | Bl | 

7 + <- x -- d(n-1) = 2 => + 

, 

i A2 B2 | 

7 + <- x d(n-2) -- x -> + 

, 

7 second order IIR 

; —— 
, 

; 5.2.2 Activation 

: Activation example: 

. CALL iir_task 

; Reentrancy: No 

7 Recursive : No 

, 

; 5.2.3 Inputs 

, 

; Data structure: AR6 

7 Data Format: 16-bit input buffer pointer 

; Modified: Yes 

; Description: either point to PING/PONG buffer 
rd 

7 5.2.4 Outputs 

, 

; Data structure: AR7 

; Data Format: 16-bit output buffer pointer 

; Modified: Yes 

7 Description: either point to PING/PONG buffer 
, 

; 5.2.5 Global 

; Data structure: AR1 

; Data Format: 16-bit index counter 

; Modified: Yes 

; Description: checks if 256 samples are processed 
, 

7 Data structure: AR5 

7 Data Format: 16-bit data buffer pointer 

; Modified: Yes 

7 Description: inlcudes both feed forward and feedback paths 
rg 

; Data structure: AR4 

7 Data Format: 16-bit coefficient buffer pointer 
; Modified: Yes 

; Description: contains 2 biquad sections 

4 

; Data structure: dir y 

: Data Format: 16-bit variable 

; Modified: Yes 

; Description: holds the output of the 2 biquad sections 
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Example 10-15. Low-Pass Biquad IIR Filter (Continued) 


; 5.2.6 Special considerations for data structure 
, 
; 5.2.7 Entry and Exit conditions 
, 
' 
: ia ai Vn aka ica Wa Vo a1 |? ia a ar | 
ta 
jin |u | 1 | 1 ia | 1 im bi se bil ie n° \° ° bass bs lng a el | 
v 
;out|u | 1 | 1 [Nu | 1 [NU [NU |UM [NU |NU |UM |UM |UM |UM |UM|NU|NU|UM |UM|NU | 
f 
; Note : UM - Used & Modified, U - Used, NU - Not Used 
’ 
; 5.2.8 Execution 
; Execution time: ?cycles 
; Call rate: not applicable for this application 
, 
, 
; HeaderEnd 
; 5.2.9 Code 
.asg AR5, IIR_DATA_P ; data samples pointer 
-asg AR4, IIR_COFF_P ; IIR filter coffs pointer 
-asg AR6, INBUF_P 
-asg AR7, OUTBUF_P 
-asg AR1, IIR_INDEX_P 
-sect eae 
iir_task: 
STM #K_FRAME_SIZE-1, BRC ; Perform filtering for 256 samples 
RPTB iir_filter_loop-1 
LD *INBUF_P+,8,A ; load the input value 
iir_filter: 
STM #d_iir_d+5, IIR_DATA_P ;AR5:d(n),d(n-1),d(n-2) 
STM #iir_coff_table, IIR_COFF_P ;AR4:coeff of IIR filter A2,A1,B2,B1,B0 
STM #K_BIQUAD-1, IIR_INDEX_P 
feedback_path: 
MAC *TIR_COFF_P+,*IIR_DATA P-,A ; inputt+d (n-2) *A2 
MAC *TIR_COFF_P,*IIR_DATA _P,A ;inputtd(n-2) *A2+d(n-1) *A1/2 
MAC *TIR_COFF_P+,*IIR_DATA P-,A ; A = Atd(n-1) *A1/2 
STH A, *IIR_DATA P+ ;d(n) = input+d(n-2) *A2+d(n-1) *Al 
MAR *TIR_DATA_P+ 
* Forward path 
MPY *TIR_COFF_P+,*IIR DATA P-,A ;d(n-2) *B2 
MAC *TIR_COFF_P+,*IIR_DATA P,A 3d (n-2) *B2+d(n-1) *B1 
DELAY *IIR_DATA_P- ;d(n-2) =d (n-1) 
eloop: 
BANZD feedback_path, *IIR_INDEX_P- 
MAC *TIR_COFF_P+,*IIR_DATA P,A ;d(n-2) *B2+d (n-1) *B1+d(n) *BO 
DELAY *IIR_DATA_P- 7 d(n-1)=d (n) 
STH A,d_iir_y ; output=d (n-2) *B2+d (n-1) *B1+d(n) *BO 
LD d_iir_y,2,A 7 scale the output 
STL A, *OUTBUF_P+ ; veplace the oldest bffr value 
iir_filter_loop 
RET 
.end 
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Example 10-16. Adaptive Filtering Using LMS Instruction 


; TEXAS INSTRUMENTS INCORPORATED 
; DSP Data Communication System Development / ASP 


; Archives: PVCS 

; Filename: adapt.asm 

; Version: 1.0 

; Status +: draft () 

7 proposal (X) 

; accepted ( ) dd-mm-yy/?acceptor. 
, 

; AUTHOR Padma P. Mallela 

lf 

; Application Specific Products 

: Data Communication System Development 
; 12203 SW Freeway, MS 701 

; Stafford, TX 77477 


; IPR statements description (can be collected). 


7(C) Copyright 1996. Texas Instruments. All rights reserved. 


; Change history: 


7 VERSION DATE i AUTHORS COMMENT 
; 1.0 July-24-96 / P.Mallela original created 
r 

i} 

i{ 

; 1. ABSTRACT 

, 

; 1.1 Function Type 

; a.Core Routine 

: b. Subroutine 

, 

7 1.2 Functional Description 

; This file contains two subroutines: 

; 1) adapt_init 

: 2) adapt_task 

: 1.3 Specification/Design Reference (optional) 
; called by task.asm depending upon the task 
ra 

7 1.4 Module Test Document Reference 

: Not done 

rg 

7 1.5 Compilation Information 

; Compiler: TMS320C54X ASSEMBLER 

; Version: 1,02 (PC) 

: Activation: asm500 -s adapt.asm 
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Example 10-16. Adaptive Filtering Using LMS Instruction (Continued) 


; 1.6 Notes and Special Considerations 
; = 
i} 
i{ 
; 2. VOCABULARY 
; 
; 2.1 Definition of Special Words, Keywords (optional) 
; ~ 
; 2.2 Local Compiler Flags 
; = 
: 2.3 Local Constants 
; _ 
i} 
i{ 
F 3. EXTERNAL RESOURCES 
; 
7 3.1 Include Files 
-mmregs 
- include "main.inc” 
; 3.2 External Data 
Hi .cef ADAPT_DP 
.ref d_mu, d_error,d_primary, d_output, d_mu, d_mu_e, d_new_x 
.ref scoff,hcoff,wcoff 
.ref xh, xw, d_adapt_count 
; 3.3 Import Functions 
i} 
i{ 
i. 4. INTERNAL RESOURCES 
r 
; 4.1 Local Static Data 
; _ 
. 4.2 Global Static Data 
; _ 
; 4.3 Dynamic Data 
; = 
} 4.4 Temporary Data 
; 7 
; 4.5 Export Functions 
.def adapt_init, adapt_task 
i} 
3 
; 5. SUBROUTINE CODE 
; HeaderBegin 
, 
, 
a ee een mee a eee tere ee eee ee 
; 5.1 adapt_init 
, 
; 5.1.1 Functional Description 
, 
- This subroutine moves filter coeffcients from program to data space. 
; Initializes the adaptive coefficients, buffers,vars,and sets the circular 
: buffer address for processing. 
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Example 10-16. Adaptive Filtering Using LMS Instruction (Continued) 


Ne Ne Ne Ne Ne Ne 


.1.2 Activation 


Activation example: 


CALL adapt_init 

Reentrancy: No 

Recursive : No 
-1.3 Inputs 

NONE 


-1.4 Outputs 


, 

, 

, 

, 

, 

; NONE 

, 

; 5.1.5 Global 

; Data structure: ARO 

: Data Format: 16-bit index pointer 

: Modified: No 

: Description: uses in circular addressing mode for indexing 

, 

; Data structure: ARI1 

; Data Format: 16-bit pointer 

; Modified: Yes 

; Description: used in initializing buffers and vars 

, 

; Data structure: AR3 

; Data Format: 16-bit x(n) data buffer pointer for H(z) 

: Modified: Yes 

; Description: initializes the pointer 

, 

: Data structure: AR5 

; Data Format: 16-bit x(n) data buffer pointer for W(z) 

H Modified: Yes 

7 Description: initializes the pointer 

, 

7 5.1.6 Special considerations for data structure 

; ~ 

7 5.1.7 Entry and Exit conditions 

; 

i |DP |ovM|SxM|C16|FRCT|ASM|ARO|AR1|AR2|AR3|AR4|AR5|AR6|AR7|A |B |BK|BRC| T|TRN| 
; | 1 dt Aes eee eee eee ee 
jin ju | 1 | 1 [Nu | 1) |nu [NU [NU [NU [NU [NU |NU |NU |NU |UM|NU|NU|NU |NUI|NU | 
ett fo fp fp 
jout|U | 1 | 1 [Nu | 1 [NU [UM |UM |NU |UM |NU |UM |NU [NU |UM|UM|NU|NU |NU|NU | 
; Note UM - Used & Modified, U - Used, NU - Not Used 

; 

; 5.1.8 Execution 

; Execution time: ?cycles 

; Call rate: not applicable for this application 

, 

, 

; HeaderEnd 
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Example 10-16. Adaptive Filtering Using LMS Instruction (Continued) 


; 5.1.9 Code 
-asg ARO, INDEX_P 
.asg AR1, INIT_P , initialize buffer pointer 
-asg AR3, XH_DATA_P ; Gata coeff buffer pointer 
-asg AR5, XW_DATA_P ; Gata coeff buffer pointer 


; for cal.y output 
-sect “filter” 


adapt_init: 
. initialize input data location, input to hybrid, with Zero. 
STM #xh, INIT_P 
RPTZ A, #H_FILT_SIZE-1 
STL A, *INIT_P+ 
; initialize input data location, input to adaptive filter, with Zero. 
STM #xw, INIT_P 
RPTZ A, #ADPT_FILT_SIZE-1 
STL A, *INIT_P+ 
7 initialize adaptive coefficient with Zero. 
STM #wcoff, INIT_P 
RPTZ A, #ADPT_FILT_SIZE-1 
STL A, *INIT_Pt+ 
’ initialize temporary storage loacations with zero 
STM #d_primary, INIT_P 
RPTZ A, #6 
STL A, *INIT_Pt 
; copy system coefficient into RAM location, Rverse order 
STM #hcoff, INIT_P 
RPT #H_FILT_SIZE-1 
MVPD #scoff, *INIT_Pt+ 
: LD #ADAPT_DP,DP ;set DP now and not worry about it 
ST #K_mu, d_mu 
STM #1, INDEX_P ; increment value to be used by 
; dual address 
7 associate auxilary registers for circular computation 
STM #xh+H_FILT_SIZE-1, XH_DATA_P ; last input of hybrid buffer 
RETD 
STM #XwWt+ADPT_FILT_SIZE-1,XW_DATA_P ;last element of input buffer 
; 5. SUBROUTINE CODE 
: HeaderBegin 
, 
, 
j -------------------------------------------- +--+ +--+ - ~~ +--+ +--+ +--+ +--+ 
; 52 adapt_task 
’ 
7 5.2.1 Functional Description 
i 
; This subroutine performs the adaptive filtering.The newest sample is 
A stored in a seperate location since filtering and adaptation are performed 
: at the same time. Otherwise the oldest sample is over written before 
; up dating the w(N-1) coefficient. 
ti 
; d_primary = xh *hcoff 
, d_output = xw *wcoff 
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Example 10-16. Adaptive Filtering Using LMS Instruction (Continued) 


; LMS algorithm: 

A w(itl) = w(i)+d*mu_error*xw(n-i) for i = 0,1,...127 and n = 0,1,..... 
; This program can run in two steps 

: 1. Initial stepsize, d_mu = 0x0. At this point, the system is not 
7 identified since the coefficients are not adapted and the error 

7 signal e (n) is d (n). This is the default mode 

; 2. At the EVM debugger command window change the step size 

; d_mu —- 0x000, with the command e * d_mu = 0x1000 

; This changes the stepsize. The error signal e(n) in this case 

7 is approximately 0 (theoretically) and the system is identified. 

ar aaa a i a am a aac aan ia an ea ae dr a ea ae es a ; 
; 5.2.2 Activation 

; Activation example: 

: CALL adapt_task 

7 Reentrancy: No 

; Recursive No 

, 

7 5.2.3 Inputs 

, 

; Data structure: AR3 

; Data Format: 16-bit x(n) data buffer pointer for H(Z) 

; Modified: Yes 

7 Description: uses circular buffer addressing mode of size 128 
, 

; Data structure: AR5 

7 Data Format: 16-bit x(n) data buffer pointer for W(z) 

7 Modified: Yes 

; Description: uses circular buffer addressing mode of size 128 
, 

; Data structure: AR6 

7 Data Format: 16-bit input buffer pointer 

; Modified: Yes 

; Description: either point to PING/PONG buffer 

, 

; 5.2.4 Outputs 

v 

; Data structure: AR7 

; Data Format: 16-bit output buffer pointer 

7 Modified: Yes 

; Description: either point to PING/PONG buffer 

v 

; 5.2.5 Global 

v 

; Data structure: AR2 

7 Data Format: 16-bit H(z) coeff buffer pointer 

; Modified: Yes 

; Description: uses circular buffer addressing mode of size 128 
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Data Format: 
Modified: 
Description: 


Data structure: 


AR4 
16-bit W(z) 
Yes 
uses circular buffer addressing mode of size 128 


coeff buffer pointer 
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Example 10-16. Adaptive Filtering Using LMS Instruction (Continued) 


; Data structure: d_adapt_count 

; Data Format: 16-bit variable 

; Modified: Yes 

7 Description: counter to check for processing 256 samples 

r 

; Data structure: d_new_x 

; Data Format: 16-bit variable 

: Modified: Yes 

7 Description: holds the newest sample 

, 

7 Data structure: d_primary 

; Data Format: 16-bit variable 

; Modified: Yes 

; Description: d_primary = xh * hcoeff 

, 

: Data structure: d_output 

: Data Format: 16-bit variable 

; Modified: Yes 

A Description: d_output = xw * wecoff 

, 

} Data structure: d_error 

; Data Format: 16-bit variable 

; Modified: Yes 

A Description: d_error = d_primary-d_output 

, 

7 Data structure: d_mu_e 

; Data Format: 16-bit variable 

: Modified: Yes 

: Description: d_mu_e = mu*d_error 

r 

; 5.2.6 Special considerations for data structure 

; es 

; 5.2.7 Entry and Exit conditions 

; 

7 | DP | OVM| SXM|C16|FRCT|ASM|ARO|AR1|AR2|AR3|AR4|AR5|AR6|AR7|A |B |BK|BRC| T|TRN| 
7. || | | fo of fd | | | 
jin |u | DW a lwo | 1 |nu ju. [NU [NU is [nu | u |u| vu |uM|Nu/NU|NU |NU/NU | 
: | | | 

pout |U | 1 | 1 [Nu | 1 {NU lo @ | UM re le om UM |UM Pee eee ee | 
; Note UM - Used & Modified, U - Used, NU - Not Used 

r 

; 5.2.8 Execution 

; Execution time: ?cycles 

7 Call rate: not applicable for this application 

, 

, 

; HeaderEnd 
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Example 10-16. Adaptive Filtering Using LMS Instruction (Continued) 


; 5.2.9 Code 


-asg AR2,H_COFF_P , H(Z) coeff buffer pointer 
-asg AR3,XH_DATA_P ; Gata coeff buffer pointer 
-asg AR6, INBUF_P ; input buffer address pointer 
-asg AR7, OUTBUF_P , output buffer address pointer 
; for cal. primary input 
-asg AR4,W_COFF_P , W(z) coeff buffer pointer 
-asg AR5, XW_DATA_P ; Gata coeff buffer pointer 
-sect "filter” 
adapt_task: 
STM #H_FILT_SIZE, BK ; first circular buffer size 
STM #hcoff, H_COFF_P ; H_COFF_P --> last of sys coeff 
ADDM #1,d_adapt_count 
LD *INBUF_P+, A ; load the input sample 
STM #wcoff,W_COFF_P ; veset coeff buffer 
STL A, d_new_x ; read in new data 
LD d_new_x,A 4 
STL A, *XH_DATA_P+0% ; store in the buffer 
RPTZ A, #H_FILT_SIZE-1 ; Repeat 128 times 
MAC *H COFF_P+0%,*XH_DATA P+0%,A ; mult & acc:a = ad¢+t (h * x) 
STH A,d_primary ; primary signal 
7 start simultaneous filtering and updating the adaptive filter here. 
LD d_mu_e, T ; T = step_size*error 
SUB B,B ;, zero acc B 
STM # (ADPT_FILT_SIZE-2) ,BRC ; set block repeat counter 
RPTBD ims_end-1 
MPY *XW_DATA_P+0%, A 7 error * oldest sample 
LMS *W_COFF_P, *XW_DATA_P ; B = filtered output (y) 
; Update filter coeff 
ST A, *W_COFF_P+ 7 save updated filter coeff 
| | MPY*XW_DATA P+03%,A j; error *x[n-(N-1) ] 
LMS *W_COFF_P, *XW_DATA_P B = accum filtered output y 
; Update filter coeff 
lms_end 
STH A, *W_COFF_P ; final coeff 
MPY *XW_DATA_P,A ; x (0) *h (0) 
MVKD #d_new_x, *XW_DATA_P ; store the newest sample 
LMS *W_COFF_P, *XW_DATA_P+0% 
STH B, d_output ; store the fitlered output 
LD d_primary,A 
SUB d_output,A 
STL A, d_error ; store the residual error signal 
LD d_mu, T 
MPY d_error,A ; A=ur*e 
STH A,d_mu_e 7 save the error *step_size 
LD d_error,A ; residual error signal 
STL A, *OUTBUF_P+ 
LD #K_FRAME_SIZE,A ; check if a frame of samples 
SUB d_adapt_count,A ; have been processed 
BC adapt_task,AGT 
RETD 
ST #K_0,d_adapt_count ; restore the count 
end 
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Example 10-16. Adaptive Filtering Using LMS Instruction (Continued) 


* This is an input file used by the adaptive filter program. 
* The transfer function is the system to be identifued by the adaptive filter 


.word OFFFDh 
.word 24h 
.word 6h 
.word OFFFDh 
.word 3h 
.word 3h 
.word OFFE9Qh 
.word 7h 
.word 12h 
.word 1Ch 
.word OFFF3h 
.word OFFE8h 
.word OCh 
.word 3h 
.word 1Eh 
.word 1Ah 
.word 22h 
.word OFFF5h 
.word OFFE5h 
.word OFFF1h 
.word OFFC5h 
.word OCh 
.word OFFE8h 
.word 37h 
.word OFFE4h 
.word OFFCAh 
.word 1Ch 
.word OFFFDh 
.word 21h 
.word OFFF7h 
.word 2Eh 
.word 28h 
.word OFFC6h 
.word 53h 
.word OFFBOh 
.word 55h 
.word OFF36h 
.word 5h 
.word OFFCFh 
.word OFF99h 
.word 64h 
.word 4lh 
.word OFFF1ih 
.word OFFDFh 
.word ODih 
.word 6Ch 
.word 57h 
.word 36h 
.word OAOh 
.word OFEE3h 
.word 6h 
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.word 
.word 
.word 
.word 
.word 
.word 
.word 
.word 
.word 
.word 
.word 
.word 
.word 
.word 
.word 
.word 
.word 
.word 
.word 
.word 
.word 
.word 
.word 
.word 
.word 
.word 
.word 
.word 
.word 
.word 
.word 
.word 
.word 
.word 
.word 
.word 
.word 
.word 
.word 
.word 
.word 
.word 
.word 
.word 
.word 
.word 
.word 
.word 
.word 
.word 
.word 
.word 
.word 


OFEC5h 
OABh 
185h 
OFFF6h 
93h 
1Fh 
10Eh 
59h 
OFEFOh 
96h 
OFFBFh 
OFF47h 
OFF76h 
OFFOBh 
OFFAFh 
14Bh 
OFF3Bh 
132h 
289h 
8Dh 
OFE1Dh 
OFE1Bh 
OD4h 
OFF69h 
14Fh 
2AAh 
OFD43h 
OF98Fh 
451h 
13Ch 
OFEF7h 
OFE36h 
80h 
OFFBBh 
OFC8Eh 
10Eh 
37Dh 
6FAh 
1h 
OFD89h 
198h 
OFE4Ch 
OFE78h 
OF215h 
479h 
749h 
289h 
OF667h 
304h 
5F8h 
34Fh 
47Bh 
OFF7Fh 


PRELIMINARY 


PRELIMINARY 


PRELIMINARY 


Application Code 


Example 10-16. Adaptive Filtering Using LMS Instruction (Continued) 


.word 
.word 
.word 
.word 
.word 
.word 
.word 
.word 
.word 
.word 
.word 
.word 
.word 
.word 
.word 
.word 
.word 
.word 
.word 
.word 
.word 
.word 
.word 
.word 


85Bh 

OF837h 
OF 77Eh 
OFF80h 
OB9Bh 
OFO3Ah 


0 


EE66h 


OFE28h 
OFADOh 
8C3h 
OF5D6h 
14DCh 
OF3A7h 


0 


E542h 


10F2h 
566h 
2 6AAh 
15Ah 
2853h 


0 


ERE95h 


93Dh 
20Dh 
1230h 
238Ah 
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Example 10-17. 256-Point Real FFT Initialization 


; TEXAS INSTRUMENTS INCORPORATED 
; DSP Data Communication System Development / ASP 


; Archives: PVCS 

; Filename: rfft.asm 

; Version: 10 

; Status : draft (_) 

: proposal (X) 

7 accepted ( ) dd-mm-yy/?acceptor. 
’ 

; AUTHOR Simon Lau and Nathan Baltz 

’ 

7 Application Specific Products 

; Data Communication System Development 
; 12203 SW Freeway, MS 701 

7 Stafford, TX 77477 


; IPR statements description (can be collected). 


7(C) Copyright 1996. Texas Instruments. All rights reserved. 


; Change history: 


7 VERSION DATE / AUTHORS COMMENT 
; 1.0 July-17-96 / Simon & Nathan original created 
r 

i} 

i{ 

; 1. ABSTRACT 

r 

; 1.1 Function Type 

; a.Core Routine 

; b.Subroutine 

, 

; 1.2 Functional Description 

; This file contains core routine: 

; rfft 

, 

7 1.3 Specification/Design Reference (optional) 
’ 

; 1.4 Module Test Document Reference 

; Not done 

¥ 

; 1.5 Compilation Information 

; Compiler: TMS320C54X ASSEMBLER 

; Version: 1.02 (PC) 

; Activation: asm500 -s rfft.asm 

, 

; 1.6 Notes and Special Considerations 
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Example 10-17. 256-Point Real FFT Initialization (Continued) 


H 2. VOCABULARY 
r 
A 2.1 Definition of Special Words, Keywords (optional) 
; = 
; 2.2 Local Compiler Flags 
; _ 
: 2.3 Local Constants 
; = 
i} 
i{ 
Hi 3. EXTERNAL RESOURCES 
, 
; 3.1 Include Files 
-mmregs 
- include "main.inc” 
- include “init_54x.inc” 
: 3.2 External Data 
.ref bit_rev, fft, unpack 
ref power 
.cef sine,cosine 
.ref sine_table,cos_table 
: 3.3 Import Functions 
i} 
i{ 
H 4. INTERNAL RESOURCES 
, 
7 4.1 Local Static Data 
; BS 
; 4.2 Global Static Data 
; = 
; 4.3 Dynamic Data 
; - 
7 4.4 Temporary Data 
; = 
: 4.5 Export Functions 
.def rfft_task 
i} 
; 5. SUBROUTINE CODE 
; HeaderBegin 
, 
r 
joccc- BUS Se eee eee eee eae epee Penge eee oe pee eee as 
7 Sk, PEL 
, 
; 5.2 Functional Description 


The following code implements a Radix-2, DIT, 2N-point Real FFT for the 
TMS320C54x. This main program makes four function calls, each 
corresponds to a different phase of the algorithm. For more details about 
how each phase is implemented, see bit_rev.asm, fft.asm, unpack.asm, and 
power.asm assembly files. 
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Example 10-17. 256-Point Real FFT Initialization (Continued) 


5 5.3 Activation 

: Activation example: 

; CALL rfft 

7 Reentrancy: No 

; Recursive No 

; 5.4 Inputs 

; NONE 

; 5.5 Outputs 

; NONE 

, 

: 5.6 Global 

, 

A Data structure: AR1 

; Data Format: 16-bit pointer 
; Modified: No 

7 Description: used for moving the twiddle tables from 


program to data 


: 5.7 Special considerations for data structure 

5.8 a and Exit conditions 

|DP | ovM| SxM sc iia (eka bs FN) Ui I Gal ia ks |B jaan a sae 
a bs | 1 | 1 |nu | 1 [Nu |NU |NU fia [nu |Nu [NU ho | | NU eee sof 
ue |D | 1 | 1 |NU | 1 ‘0 hs lon | NU ao I i ' ia eee beg 
Note UM — Used & Modified, U - Used, NU - Not Used 

5.9 Execution 

; Execution time: ?cycles 

; Call rate: not applicable for this application 


; HeaderEnd 

; 5.10 Code 
-asg 
~sect 

rfft_task: 
STM 
RPT 
MVPD 
STM 
RPT 
MVPD 
CALL 
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AR1,FFT_TWID_P 
"vftft_prg” 


#Sine,FFT_TWID_P 

#K_FFT_SIZE-1 ; 

#Sine_table, *FFT_TWID_P+ ; 

#cosine,FFT_TWID_P 

#K_FFT_SIZE-1 ; move FIR coeffs from program 

#cos_table, *FFT_TWID_P+ ; to data 

bit_rev 

fEt 

unpack 

power 

#K_ST1,ST1 7 xvestore the original contents of 
; ST1 since ASM field has changed 
; return to main program 


move FIR coeffs from program 
to data 
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Example 10-18. Bit Reversal Routine 


; TEXAS INSTRUMENTS INCORPORATED 

; DSP Data Communication System Development / ASP ; 
r 

; Archives: PVCS 

; Filename: bit_rev.asm 

; Version: 1.0 

; Status draft 

‘ proposal (X) 

; accepted (_) dd-mm-yy/?acceptor. 
, 

; AUTHOR Simon Lau and Nathan Baltz 


r{ 
; IPR st 
7} 


Application Specific Products 

Data Communication System Development 
12203 SW Freeway, MS 701 

Stafford, TX 77477 


atements description (can be collected). 


7 (C) Copyright 1996. Texas Instruments. All rights reserved. 


if 


; Change 
¥ 

’ 

v 

, 

i} 

i{ 

; 1. 
, 

; Link 


; 1.2 
, 
, 
r 
; 1.3 
r 
, 
; 1.4 
, 
r 
; 1D 
, 
r 
r 
r 
; Tvx'6 


history: 
VERSION DATE / AUTHORS COMMENT 
1.0 July-17-96 / Simon & Nathan original created 
ABSTRACT 
Function Type 


a.Core Routine 
b. Subroutine 


Functional Description 
This file contains one subroutine: 
bit_rev 


Specification/Design Reference (optional) 
called by rfft.asm depending upon the task thru CALA 


Module Test Document Reference 

Not done 

Compilation Information 

Compiler: TMS320C54X ASSEMBLER 
Version: 1.02 (PC) 

Activation: asm500 -s bit_rev.asm 
Notes and Special Considerations 
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Example 10-18. Bit Reversal Routine (Continued) 


; 2. VOCABULARY 
, 
; 2.1 Definition of Special Words, Keywords (optional) 
; = 
; 2.2 Local Compiler Flags 
; = 
; 2.3 Local Constants 
; = 
i} 
i{ 
; 3. EXTERNAL RESOURCES 
, 
; 3.1 Include Files 
-mmregs 
- include "main.inc” 
; 3.2 External Data 
.ref d_input_addr, fft_data 
; 3.3 Import Functions 
i} 
i{ 
; 4. INTERNAL RESOURCES 
, 
7 4.1 Local Static Data 
; a 
; 4.2 Global Static Data 
; _ 
; 4.3 Dynamic Data 
; = 
; 4.4 Temporary Data 
; = 
; 4.5 Export Functions 
.def bit_rev 
i} 
i 5. SUBROUTINE CODE 
; HeaderBegin 
; 
; 
j --------------------------------------+-+----~------~ ~~~ = ~~~ ~~~ ~~~ ~~~ ~~~ ~~~ +--+ 
; 5.1 bit_rev 
, 
; 5.2 Functional Description 
; This function is called from the main module of the ’C54x Real FFT code. 
7 It reorders the original 2N-point real input sequence by using 
; bit-reversed addressing. This new sequence is stored into the data 
; processing buffer of size 2N, where FFT will be performed in-place 
7 during Phase Two. 
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Example 10-18. Bit Reversal Routine (Continued) 


5.3 


5.4 


“0 


5.6 


Activation 

Activation example: 

CALL bit_rev 

Reentrancy: No 

Recursive No 

Inputs 

NONE 

Outputs 

NONE 

Global 

Data structure: ARO 

Data Format: 16-bit index pointer 

Modified: No 

Description: used for bit reversed addressing 

Data structure: AR2 

Data Format: 16-bit pointer 

Modified: Yes 

Description: pointer to processed data in bit-reversed order 
Data structure: AR3 

Data Format: 16-bit pointer 

Modified: Yes 

Description: pointer to original input data in natural order 
Data structure: AR7 

Data Format: 16-bit pointer 

Modified: Yes 

Description: starting addressing of data processing buffer 


5.7 Special considerations for data structure 


5.8 Entry and Exit conditions 


iil oie ig a FRCT|ASM|ARO|AR1|AR2|AR3|AR4|AR5|AR6|AR7|A |B |BK|BRC| T|TRN| 
| | |e ee (Cee eee ee oe ee ee ee ee 
jin ju | 1 | 1 le | 1 |nu [nu [nu [Nu [Nu |NU |NU |NU |NU |NU|NU|NU|NU |NU|NU | 
| | | } | | | | | | | | | [ | | 
jout |U 1 | 1 |Nu | 1 [nu Jum [Nu |umM [um |NU [NU |NU |UM |NU|NU|NU|UM |NU|NU | 
Note UM — Used & Modified, U - Used, NU - Not Used 
5.9 Execution 
Execution time: ?cycles 
Call rate: not applicable for this application 
;HeaderEnd 
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Example 10-18. Bit Reversal Routine (Continued) 


; 5.10 Code 
-asg AR2, REORDERED_DATA 
-asg AR3, ORIGINAL_INPUT 
-asg AR7,DATA_PROC_BUF 
-sect "CELE prog” 

bit_rev: 
SSBX FRCT ; fractional mode is on 
MVDK d_input_addr, ORIGINAL_INPUT , AR3 -> 1 st original input 
STM #££t_data, DATA_PROC_BUF ; AR7 -> data processing buffer 
MVMM DATA_PROC_BUF, REORDERED_DATA ; AR2 -> lst bit-reversed data 
STM #K_FFT_SIZE-1,BRC 
RP TBD bit_rev_end-1 
STM #K_FFT_SIZE, ARO ; ARO = 1/2 size of circ buffer 
MVDD *ORIGINAL_INPUT+, *REORDERED_DATA+ 
MVDD *ORIGINAL_INPUT-, *REORDERED_DATA+ 
MAR *ORIGINAL INPUT+0B 

bit_rev_end 
RET ; veturn to Real FFT main module 
end 
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Example 10-19. 256-Point Real FFT Routine 


; TEXAS INSTRUMENTS INCORPORATED 
; DSP Data Communication System Development / ASP 


; Archives: PVCS 

; Filename: fft.asm 

; Version: 1.0 

; Status : draft () 

; proposal (X) 

; accepted ( ) dd-mm-yy/?acceptor. 
Tv 

; AUTHOR Simon Lau and Nathan Baltz 

’ 

; Application Specific Products 

; Data Communication System Development 
; 12203 SW Freeway, MS 701 

; Stafford, TX 77477 


i { 

£ 

; IPR statements description (can be collected). 

rp 

ti 

7 (C) Copyright 1996. Texas Instruments. All rights reserved. 


i { 
Fd 
; Change history: 


H VERSION DATE / AUTHORS COMMENT 
; 1.0 July-17-96 / Simon & Nathan original created 
, 

i} 

i{ 

; 1. ABSTRACT 

3 

; 1.1 Function Type 

7 a.Core Routine 

: b.Subroutine 

F 

7 1.2 Functional Description 

. This file contains one subroutine: 

; fft 

r 

; 1.3 Specification/Design Reference (optional) 

: called by rfft.asm depending upon the task thru CALA 
Fa 

7 1.4 Module Test Document Reference 

; Not done 

i 

: 1.5 Compilation Information 

A Compiler: TMS320C54X ASSEMBLER 

; Version: 1.02. (PC) 

- Activation: asm500 -s fft.asm 

ti 

: 1.6 Notes and Special Considerations 
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Example 10-19. 256-Point Real FFT Routine (Continued) 


~ 
ase 


wee 


+2 


res: 


eal 


o2 


23 


Ss 
HeaderBegin 


VOCABULARY 


Definition of Special Words, Keywords (optional) 


Local Compiler Flags 


Local Constants 


EXTERNAL RESOURCES 


Include Files 

-mmregs 

-include "main.inc” 

External Data 

.ref fft_data, d_grps_cnt, 


d_twid_idx, d_data_idx, sine, cosine 


Import Functions 


INTERNAL RESOURCES 


Local Static Data 


Global Static Data 


Dynamic Data 


Temporary Data 
Export Functions 
.def FEE 


SUBROUTINE CODE 
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Died, EEE 


5.2 Functional Description 
PHASE TWO (LogN)-Stage Complex FFT 

This function is called from main module of the ’C54x Real FFT code. 
Here we assume that the original 2N-point real input sequence is al 
ready packed into an N-point complex sequence and stored into the 
data processing buffer in bit-reversed order (as done in Phase One). 
Now we perform an in-place, N-point complex FFT on the data proces 
sing buffer, dividing the outputs by 2 at the end of each stage to 
prevent overflow. The resulting N-point complex sequence will be un- 
packed into a 2N-point complex sequencein Phase Three & Four. 
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Example 10-19. 256-Point Real FFT Routine (Continued) 


; 5.3 Activation 

7 Activation example: 

; CALL fft 

; Reentrancy: No 

: Recursive No 

, 

; 5.4 Inputs 

; NONE 

: 5.5 Outputs 

; NONE 

, 

; 5.6 Global 

, 

; Data structure: ARO 

; Data Format: 16-bit index pointer 

; Modified: No 

; Description: index to twiddle tables 
rd 

; Data structure: AR1L 

; Data Format: 16-bit counter 

; Modified: No 

; Description: group counter 

F 

; Data structure: AR2 

; Data Format: 16-bit pointer 

; Modified: Yes 

A Description: pointer to lst butterfly data PR,PI 
ti 

- Data structure: AR3 

; Data Format: 16-bit pointer 

; Modified: Yes 

; Description: pointer to 2nd butterfly data OR,QI 
fi 

. Data structure: AR4 

: Data Format: 16-bit pointer 

; Modified: Yes 

: Description: pointer to cosine value WR 
Fa 

7 Data structure: AR5 

; Data Format: 16-bit pointer 

; Modified: Yes 

7 Description: pointer to cosine value WI 
F 3 

7 Data structure: AR6 

; Data Format: 16-bit counter 

; Modified: Yes 

: Description: butterfly counter 

Fa 

; Data structure: AR7 

H Data Format: 16-bit pointer 

; Modified: Yes 

; Description: start address of data processing buffer 
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Example 10-19. 256-Point Real FFT Routine (Continued) 


; 5.7 Special considerations for data structure 
, 
7 5.8 Entry and Exit conditions 
, 
; |pP |ovm|Sxm oe Ren eee are |r [eel |e wre |B |BK|BRc| T|TRN| 
: ; | | | | | 
;in Ju | 1 |] 1 [Nu | 1 | 0 ial ia NU ae KG bo bo bo igs ase ee | 
, 
;out|u | 1 | 1 [nu | 1 [-1 |um |um |umM |uM a i jum |uM |uUM|UM|UM|UM |NU|NU | 
, 
; Note : UM - Used & Modified, U - Used, NU - Not Used 
, 
; 5.9 Execution 
; Execution time: ?cycles 
: Call rate: not applicable for this application 
r 
r 
: HeaderEnd 
7 5.10 Code 
-asg AR1,GROUP_COUNTER 
.asg AR2,PX 
-asg AR3,QX 
.asg AR4,WR 
-asg AR5,WI 
-asg AR6, BUTTERFLY_COUNTER 
.asg AR7, DATA_PROC_BUF ; for Stages 1 & 2 
.asg AR7, STAGE_COUNTER , for the remaining stages 
-sect VEELE_ prog” 
LEE 
; Stage 1 -- - - -- -- -- re ae Se 
STM #K_ZERO_BK, BK ; BK=0 so that *ARn+0% == *ARn+0 
LD #-1,ASM ; outputs div by 2 at each stage 
MVMM DATA_PROC_BUF, PX ; PX -> PR 
LD *PX,A ; A := PR 
STM #f£ft_data+K_DATA_IDX_1,QX 7, QX -> QR 
STM #K_FFT_SIZE/2-1,BRC 
RPTBD stagelend-1 
STM #K_DATA_IDX_1+1, ARO 
SUB *QxX,16,A,B ; B := PR-OR 
ADD *QX,16,A ; A := PRtOR 
STH A, ASM, *PX+ ; PR’ := (PR+QR) /2 
ST B, *QX+ ; QR’ := (PR-OR) /2 
| | LD *PX,A ; A o:= PI 
SUB *QX,16,A,B ; B := PI-QI 
ADD *QX,16,A ; A := PI+OQI 
STH A, ASM, *PX+0 ; PI’ := (PI+QTI)/2 
ST B, *QX+0% j; QI’ := (PI-QI)/2 
| | LD *PX,A ; A := next PR 
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Example 10-19. 256-Point Real FFT Routine (Continued) 


stagelend: 
; Stage 2 ===> ——= == = a === - 
MVMM DATA_PROC_BUF, PX PX -> PR 
STM #f£f£t_data+K_DATA_IDX_2,QX QX -> OR 
STM #K_FFT_SIZE/4-1,BRC 
LD *PX,A A = PR 
RPTBD stage2Zend-1 
STM #K_DATA_IDX_2+1, ARO 
; ist butterfly 
SUB *QxX,16,A,B B := PR-OQR 
ADD *QxX,16,A A := PR+tOQR 
STH A, ASM, *PX+ PR’ := (PR+tQR) /2 
ST B, *QX+ QR’ := (PR-QR) /2 
| | LD *PX,A A o:= PI 
SUB *Qx,16,A,B B = PI-QI 
ADD *QxX,16,A A = PI+QI 
STH A, ASM, *PX+ PI’:= (PI+QI) /2 
STH B, ASM, *QX+ QI’ := (PI-QI)/2 
; 2nd butterfly 
MAR *QX+ 
ADD *PX, *OX,A A := PRtQI 
SUB *PX, *QX-,B B = PR-QI 
STH A, ASM, *PX+ PR’ := (PR+QI)/2 
SUB *PX, *OX,A A := PI-OR 
ST B, *QX QR’ := (PR-QI)/2 
| | LD *QX+,B Bor QR 
ST A, *PX PI’:= (PI-QR) /2 
| | ADD *PX+0%,A A := PI+OR 
ST A, *QX+0% QI’ := (PI+QR)/2 
| | LD *PX,A A := PR 
stageZend: 
; Stage 3 thru Stage logN-1 -------------- ao e==2 aS =a=S a 


STM K_TWID_TBL_SIZE, BK 

ST #K_TWID_IDX_3,d_twid_idx 

STM #K_TWID_IDX_3, ARO 

STM cosine,WR 

STM #Ssine, WI 

STM K_LOGN-2-1, STAGE_COUNTER 

ST #K_ FFT _SIZE/8-1,d_grps_cnt 

STM K_FLY_COUNT_3-1, BUTTERFLY_COUNTER 

ST K_ DATA_IDX_3,d_data_idx 
stage: 

STM fft_data, Px 

LD d_data_idx, A 

ADD * (PX),A 

STLM A,QX 

MVDK d_grps_cnt, GROUP_COUNTER 
PRELIMINARY 


BK = twiddle table size always 
init index of twiddle table 
ARO = index of twiddle table 
init WR pointer 

init WI pointer 

init stage counter 

init group counter 

init butterfly counter 

init index for input data 


PX -> PR 


QX -> OR 
AR1 contains group counter 
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Example 10-19. 256-Point Real FFT Routine (Continued) 


group: 
MVMD 
RPTBD 
LD 


| |Mpy 


butterflyend: 


BUTTERF LY_COUNTER, BRC 
butterflyend-1 

*WR, T 

*QX+,A 

*WI+0%,*OX-,A 


*PX,16,A,B 
B, *PX 
*PX+,B 


B, *QX 
*QOX+,A 


*QX, *WR+0%,A 
*PX,16,A,B 
B, *OX+ 


*PX,B 
*WR,T 
B, *PX+ 


*QOX+,A 


; Update pointers for next group 


PSHM 
MVDK 
MAR 


; Update counters and indices for next stage 


MVDK 
fft_end: 
RET 
.end 
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ARO 

d_data_idx, ARO 

*PX+0 

*QX+0 

group, *GROUP_COUNTER- 
ARO 

*QOX- 


d_data_idx,A 

#1,A,B 

B, BUTTERFLY_COUNTER 
A,1,d_data_idx 
d_grps_cnt,A 

A, ASM, d_grps_cnt 
d_twid_idx,A 

A, ASM, d_twid_idx 

D stage, *STAGE_COUNTER- 
d_twid_idx, ARO 


of butterflies in each grp 
T = WR 
A := QR*WR || QX->OT 
A := QR*WR+QI*WI 
QX->OR 
B := (QR*WR+QI*WI)+PR 
PR’ :=((QR*WR+QI*WI)+PR) /2 
B := PR-(QR*WR+QI*WI) 
PX->PI 
QR’ := (PR-(QR*WR+QI*WI) ) /2 
A := QR*WI [T=WI] 
QX->QI 
A := QR*WI-QI*WR 
B  := (QR*WI-QI*WR)+PI 
QI’ :=((QR*WI-QI*WR) +PI) /2 
QX->OQR 
B := PI-(QR*WI-QI*WR) 
T := WR 
PI’ := (PI-(QR*WI-QI*WR) )/2 
PX->PR 
A := QR*WR || QX->OT 


preserve ARO 


increment PX for next group 
increment QX for next group 


restore ARO 


B = A-1 
BUTTERFLY_COUNTER = #flies-1 
double the index of data 


1/2 the offset to next group 
1/2 the index of twiddle table 
ARO = index of twiddle table 


return to Real FFT main module 
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Example 10-20. | Unpack 256-Point Real FFT Output 


TEXAS INSTRUMENTS INCORPORATED 
DSP Data Communication System Development / ASP 


Archives: PVCS 
Filename: unpack.asm 
Version: 1:0 
Status : draft () 
proposal (X) 
accepted ( ) dd-mm-yy/?acceptor. 
AUTHOR Simon Lau and Nathan Baltz 


Application Specific Products 

Data Communication System Development 

12203 SW Freeway, MS 701 

Stafford, TX 77477 
IPR statements description (can be collected). 
(C) Copyright 1996. Texas Instruments. All rights reserved. 
Change history: 


VERSION DATE / AUTHORS COMMENT 
1.0 July-17-96 / Simon & Nathan original created 


1. ABSTRACT 


1.1 Function Type 
a.Core Routine 
b.Subroutine 


1.2 Functional Description 
This file contains one subroutine: 


unpack 


1.3 Specification/Design Reference (optional) 
called by rfft.asm depending upon the task thru CALA 


1.4 Module Test Document Reference 
Not done 


1.5 Compilation Information 


Compiler: TMS320C54X ASSEMBLER 
Version: 1.02 (PC) 
Activation: asm500 -s unpack.asm 


1.6 Notes and Special Considerations 
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Example 10-20. Unpack 256-Point Real FFT Output (Continued) 


H 2. VOCABULARY 
, 
; 2.1 Definition of Special Words, Keywords (optional) 
; = 
; 2.2 Local Compiler Flags 
; _ 
; 2.3 Local Constants 
; = 
i} 
i{ 
i 3. EXTERNAL RESOURCES 
; 
; 3.1 Include Files 
-mmregs 
- include "main.inc” 
; 3.2 External Data 
.ref fft_data,sine, cosine 
; 3.3 Import Functions 
i} 
i{ 
4 4 INTERNAL RESOURCES 
r 
; 4.1 Local Static Data 
; = 
; 4.2 Global Static Data 
; = 
; 4.3 Dynamic Data 
; =e 
; 4.4 Temporary Data 
; = 
; 4.5 Export Functions 
.def unpack 
i} 
7 5. SUBROUTINE CODE 
; HeaderBegin 
r 
r 
; 5.1 unpack 
r 
; 5.2 Functional Description 
, 
; PHASE THREE & FOUR Unpacking to 2N Outputs 
; This function is called from the main module of the ’C54x Real FFT 
; code. It first computes four intermediate sequences (RP,RM, IP, IM) 
7 from the resulting complex sequence at th nd of the previous phase. 
7 Next, it uses the four intermediate sequences to form the FFT of the 
7 original 2N-point real input. Again, the outputs are divided by 2 to 
; prevent overflow 
; es Ae, = peyeeevenn ee, peyeenren esau peyeueren = 
10-98 
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Example 10-20. Unpack 256-Point Real FFT Output (Continued) 


Application Code 


7 5.3 Activation 

. Activation example: 

: CALL unpack 

, 

; Reentrancy: 

; Recursive 

r 

: 5.4 Inputs 

; NONE 

; 5.5 Outputs 

; NONE 

, 

‘ 5.6 Global 

i 

; 5.6.1 Phase Three Global 

t 

: Data structure: ARO 

; Data Format: 16-bit index pointer 

; Modified: No 

7 Description: index to twiddle tables 
3 

7 Data structure: AR2 

; Data Format: 16-bit pointer 

; Modified: Yes 

; Description: pointer to R[k], I[k], RP[k], IP[k] 
v 

; Data structure: AR3 

: Data Format: 16-bit pointer 

; Modified: Yes 

; Description: pointer to R[N-k], I[N-k], RP[N-k], IP[N-k] 
. 3 

; Data structure: AR6 

- Data Format: 16-bit pointer 

7 Modified: Yes 

; Description: pointer to RM[k], IM[k] 
€ 

; Data structure: AR7 

7 Data Format: 16-bit pointer 

7 Modified: Yes 

; Description: pointer to RM[n-k], IM[n-k] 
v 

: 5.6.2 Phase Four Global 

v 

; Data structure: ARO 

7 Data Format: 16-bit index pointer 

; Modified: No 

; Description: index to twiddle tables 
' 

. Data structure: AR2 

; Data Format: 16-bit counter 

; Modified: No 

; Description: pointer to RP[k], IP[k], AR[k], AI[k], AR[0] 
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Example 10-20. Unpack 256-Point Real FFT Output (Continued) 


,out 


Data structure: AR3 


Data Format: 16-bit pointer 
Modified: Yes 
Description: pointer to RM[k], IM[k], AR[2N-k], AI[2N-k] 


Data structure: AR4 


Data Format: 16-bit pointer 
Modified: Yes 
Description: pointer to cos(k*pi/N), AI[0] 


Data structure: AR5 


Data Format: 16-bit pointer 
Modified: Yes 
Description: pointer to sin(k*pi/N), AR[N], ATI[N] 


5.7 Special considerations for data structure 


5.8 Entry and Exit conditions 


5.8.1 Phase Three Entry and Exit Conditions 


|DP | OvM| SXM|C16|FRCT|ASM|ARO|AR1|AR2|AR3|AR4|AR5|AR6|AR7|A |B |BK|BRC| T|TRN 
| | | | | 

lo E | 1 {| of. o |nu |NU |NU |NU |NU nu |NU |NU|NU|O |NU |NU|NU 

| | | ; | | | | _ | | Jot fot 

[2 1 | o | 2 |-2 Jum [Nu |um |um [NU |NU [UM |UM |UM|UM|UM|UM |NU|NU 


; Note : UM —- Used & Modified, U - Used, NU - Not Used 


5.8.2 Phase Four Entry and Exit Conditions 


si is il Ss ASM|ARO|AR1|AR2|AR3|AR4 oie lat |B hi T| TRN 


uf|1if]1 | 0 


| 1 |-1 [Nu [Nu [NU [NU [NU [NU [NU i |Nu|NU|NU|NU |NU|NU 
| J ee 
| 1 |-1 |um |nu Jum |um | | 


um |uM |Nu |NU |Um|UM|UM|uUM |NU|NU 


; Note : UM —- Used & Modified, U - Used, NU - Not Used 


5.9 Execution 
Execution time: ?cycles 


, 
4 
a 
r 
a 
a 
a 
a 
a 
, 
, 
a 
r 
, 
a 
, 
r 
a 
, 
r 
, 
, 
a 
, 
a 
, 
r 
, 
a 
a 
, 
a 
r 
r 
, 
a 
a 
a 
, 
r 
a 
, 
r 
a 
r 
a 


Call rate: not applicable for this application 
HeaderEnd 
5.10 Code 
-sect "CE RE prog” 
unpack: 
; Compute intermediate values RP, RM, IP, IM 
-asg AR2,XP_k 
.asg AR3, XP_Nminusk 
.asg AR6,XM_k 
.asg AR7, XM_Nminusk 
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Example 10-20. Unpack 256-Point Real FFT Output (Continued) 


STM #fft_data+2,XP_k ; AR2 -> R[k] (temp RP[k]) 
STM #£f£t_datat+2*K_FFT_SIZE-2,XP_Nminusk ; AR3 -> R[N-k] (temp 
RP [N-k] ) 
STM #f£ft_data+2*K_FFT_SIZE+3,XM_Nminusk ; AR7 -—> temp RM[N-k] 
STM #££t_datat4*K_FFT_SIZE-1,XM_k ; AR6 -> temp RM[k] 
STM #-2+K_FFT_SIZE/2,BRC 
RPTBD phase3end-1 
STM #3,ARO 
ADD *XP_k,*XP_Nminusk,A ; A := R[k]+R[N-k] = 
2*RP [k] 
SUB *XP_k, *XP_Nminusk,B ;, B := R[k]-R[N-k] = 
2*RM[k] 
STH A, ASM, *XP_k+ ; store RP[k] at AR[k] 
STH A, ASM, *XP_Nminusk+ ; store RP[N-k]=RP[k] at 
AR[N-k] 
STH B, ASM, *XM_k- ; store RM[k] at AI[2N-k] 
NEG B ; B := R[N-k]-R[k] 
2*RM[N-k] 
STH B, ASM, *XM_Nminusk-— j; store RM[N-k] at AI[N+k] 
ADD *XP_k, *XP_Nminusk,A ; A := I[k]+I[N-k] = 
2* IP [k] 
SUB *XP_k, *XP_Nminusk,B 7 Bs I [k]-I[N-k] 
2*IM[k] 
STH A, ASM, *XP_k+ ; store IP[k] at AI[k] 
STH A, ASM, *XP_Nminusk-0O ; store IP[N-k]=IP[k] at 
AI [N-k] 
STH B, ASM, *XM_k- ; store IM[k] at AR[2N-k] 
NEG B ; B := I[N-k]-I[k] = 
2* IM[N-k] 
STH B, ASM, *XM_Nminusk+0 j; store IM[N-k] at AR[N+k] 
phase3end 
ST #0, *XM_k- ; RM[N/2]=0 
ST #0, *XM_k ; IM[N/2]=0 
; Compute AR[0],AI[0], AR[N], AI[N] 
.asg AR2,AX_k 
-asg AR4, IP_0O 
-asg AR5,AX_N 
STM #£f£t_data, AX_k ; AR2 -—> AR[O] (temp 
RP[0]) 
STM #£f£t_datat+1, IP_0O ; AR4 -> AI[O] (temp 
IP[0]) 
STM #£f£t_data+2*K_FFT_SIZE+1,AX_N ; AR5 -> AI([N] 
ADD *AX_k,*IP_O,A ; A := RP[O]+IP[0] 
SUB *AX_k,*IP_0,B ; B := RP[0O]-IP[0] 
STH A, ASM, *AX_k+ ; AR[O] = (RP[O]+IP[0])/2 
ST #0, *AX_k ; AI[O] = 0 
MVDD *AX k+,*AX_N- ; AI[N] = 0 
STH B, ASM, *AX_N ; AR[N] = (RP[0O]-IP[0])/2 
; Compute final output values AR[k], AI[k] 
-asg AR3,AX_2Nminusk 
-asg AR4,COS 
-asg AR5,SIN 
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Example 10-20. Unpack 256-Point Real FFT Output (Continued) 


STM #£f£t_data+4*K_FFT_SIZE-1,AX_2Nminusk ; AR3 -> AI[2N-1] 
(temp RM[1]) 
STM #cosine+K_TWID_TBL_SIZE/K_FFT_SIZE,COS ; AR4 -—> cos (k*pi/N) 
STM #sine+K_TWID_TBL_SIZE/K_FFT_SIZE, SIN ; AR5 -> sin(k*pi/N) 
STM #K_FFT_SIZE-2,BRC 
RPTBD phase4end-1 
STM #K_TWID_TBL_SIZE/K_FFT_SIZE, ARO ; index of twiddle 
tables 
LD *AX k+,16,A ; A := RP[k] | 
AR2->IP [k] 
MACR *COS, *AX_k,A ; A :=At+cos(k*pi/N) 
*TP[k 
MASR *SIN, *AX_2Nminusk-,A ; A := A-sin(k*pi/N) 
*RM[k] 
; || AR3->IM[k 
LD *AX 2Nminusk+,16,B ; B := IM[k] | 
AR3->RM[k] 
MASR *SIN+0%, *AX_k-,B ; B := B-sin(k*pi/N) 
*IP[k] 
; || AR2->RP[k 
MASR *COS+0%, *AX_2Nminusk,B ; B := B-cos(k*pi/N) 
*RM[k] 
STH A, ASM, *AX_k+ ; AR[k] = A/2 
STH B, ASM, *AX_k+ ; AI[k] = B/2 
NEG B ; B:= -B 
STH B, ASM, *AX_2Nminusk- ; AI[2N-k] = -AI[k] 
= B/2 
STH A, ASM, *AX_2Nminusk-— ; AR[2N-k] = AR 
[k] = A/2 
phase4end: 
RET ; vreturntoRealFFTmain module 
.end 
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Example 10-21. | Compute the Power Spectrum of the Complex Output of the 256-Point 


Real FFT 
Hi TEXAS INSTRUMENTS INCORPORATED 
? DSP Data Communication System Development / ASP 
, 
; Archives: PVCS 
; Filename: power.asm 
7 Version: 1.0 
; Status : draft () 
: proposal (X) 
7 accepted ( ) dd-mm-yy/?acceptor. 
, 
7 AUTHOR Simon Lau and Nathan Baltz 
, 
; Application Specific Products 
; Data Communication System Development 
. 12203 SW Freeway, MS 701 
; Stafford, TX 77477 
i { 
v 
; IPR statements description (can be collected). 
i} 
, 
; (C) Copyright 1996. Texas Instruments. All rights reserved. 
, 
i{ 
7 Change history: 
, 
H VERSION DATE / AUTHORS COMMENT 
; 1.0 July-17-96 / Simon & Nathan original created 
i} 
i{ 
H 1. ABSTRACT 
, 
; 1.1 Function Type 
7 a.Core Routine 
; b. Subroutine 
, 
; 1.2 Functional Description 
. This file contains one subroutine: 
7 power 
, 
; 1.3 Specification/Design Reference (optional) 
: called by rfft.asm depending upon the task thru CALA 
, 
7 1.4 Module Test Document Reference 
; Not done 
, 
: 1.5 Compilation Information 
A Compiler: TMS320C54X ASSEMBLER 
7 Version: 1.02 (PC) 
H Activation: asm500 -s power.asm 
, 
; 1.6 Notes and Special Considerations 
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Example 10-21. Compute the Power Spectrum of the Complex Output of the 256-Point 


Real FFT (Continued) 

i{ 
; 2. VOCABULARY 
, 
; 2.1 Definition of Special Words, Keywords (optional) 
; = 
; 2.2 Local Compiler Flags 
; = 
; 2.3 Local Constants 
; = 
i} 
i{ 
7 3. EXTERNAL RESOURCES 
, 
; 3.1 Include Files 

-mmregs 

- include "main.inc” 
; 3.2 External Data 

.ref fft_data, d_output_addr 
; 3.3 Import Functions 
i} 
i{ 
; 4. INTERNAL RESOURCES 
r 
; 4.1 Local Static Data 
; = 
; 4.2 Global Static Data 
; = 
; 4.3 Dynamic Data 
; = 
; 4.4 Temporary Data 
; Ss 
; 4.5 Export Functions 

.def power 
i} 
H 5. SUBROUTINE CODE 
; HeaderBegin 
, 
r 
y--- _ _ as _ _ - 
: 5.1 power 
, 
7 5.2 Functional Description 
7 PHASE FIVE Power Spectrum 
; This function is called from the main module of the ’C54x Real FFT 
: code. It computes the power spectrum of the Real FFT output. 
; 2s peseeaee = aes ae: Pi = penuanees = 
; 
; 5.3 Activation 
: Activation example: 
; CALL power 
; Reentrancy: No 
; Recursive : No 
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Example 10-21. Compute the Power Spectrum of the Complex Output of the 256-Point 


Real FFT (Continued) 
, 
7 5.4 Inputs 
; NONE 
; 5.5 Outputs 
; NONE 
, 
; 5.6 Global 
: Data structure: AR2 
- Data Format: 16-bit pointer 
; Modified: Yes 
; Description: pointer to AR[k], AI[k] 
, 
. Data structure: AR3 
: Data Format: 16-bit pointer 
: Modified: Yes 
: Description: pointer to output buffer 
, 
; 5.7 Special considerations for data structure 
; a 
A 5.8 Entry and Exit conditions 
, 
| DP | OVM| SXM|C16|FRCT|ASM|ARO|AR1|AR2|AR3|AR4|AR5|AR6|AR7|A |B |BK|BRC| T|TRN 


i; | | | | ; || 
jin ju | 1] 1 [nu | 1 [NU [NU [NU [NU [NU [NU [NU |NU |NU |NU| 
i 
| 
| 


|BK|BRC| T|TRN| 
| | 
NU|NU|NU |NU|NU 
pout |U | 1] 1 |nu [1 bi | NU ie as = iN le ia ee al dle | | | 


UM |NU|NU 
v 
; Note : UM - Used & Modified, U - Used, NU - Not Used 
v 
A 5.9 Execution 
; Execution time: ?cycles 
; Call rate: not applicable for this application 
, 
, 
; HeaderEnd 
; 5.10 Code 
.asg AR2,AX 
-asg AR3,OUTPUT_BUF 
-sect "pwr_prog” power: 
MVDK d_output_addr, OUTPUT_BUF ; AR3 points to output buffer 
STM #K_FFT_SIZE*2-1,BRC 
RPTBD power_end-1 
STM #£f£t_data, AX ; AR2 points to AR[0] 
SQUR *AX+,A ; A := AR‘2 
SQURA *AX+,A ; A := AR*2 + AI*2 
STH A, *OUTPUT_BUF+ 
power_end: 
RET , return to main program 
.end 
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TEXAS INSTRUMENTS INCORPORATED 
DSP Data Communication System Development / ASP 


Archives: PVCS 
Filename: fifo.asm 
Version: 10 
Status : draft (_) 
proposal (X) 
accepted ( ) dd-mm-yy/?acceptor. 
AUTHOR Padma P. Mallela 


Application Specific Products 
Data Communication System Development 
12203 SW Freeway, MS 701 
Stafford, TX 77477 
IPR statements description (can be collected). 


(C) Copyright 1996. Texas Instruments. All rights reserved. 


Change history: 


VERSION DATE / AUTHORS COMMENT 
Ib July-25-96 / P.Mallela original created 


1. ABSTRACT 


1.1 Function Type 
a.Core Routine 
b. Subroutine 


1.2 Functional Description 
This file contains one subroutines: 
fifo_host_transfer 
1.3 Specification/Design Reference (optional) 
called by main.asm depending upon if K_HOST_FLAG is set 


1.4 Module Test Document Reference 
Not done 


1.5 Compilation Information 


Compiler: TMS320C54X ASSEMBLER 
Version: 1.02 (PC) 
Activation: asm500 -s fifo.asm 


1.6 Notes and Special Considerations 
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Hi 2. VOCABULARY 
r 
A 2.1 Definition of Special Words, Keywords (optional) 
; = 
; 2.2 Local Compiler Flags 
; _ 
; 2.3 Local Constants 
; = 
i} 
i{ 
Hi 3. EXTERNAL RESOURCES 
, 
; 3.1 Include Files 
-mmregs 
- include "target.inc” 
; 3.2 External Data 
.ref d_command_reg 
.ref d_fifo_count 
.vef d_command_value 
.ref d_fifo_ptr 
.vref d_output_addr 
: 3.3 Import Functions 
i} 
i{ 
: 4. INTERNAL RESOURCES 
, 
5 4.1 Local Static Data 
; BS 
; 4.2 Global Static Data 
; = 
; 4.3 Dynamic Data 
; - 
7 4.4 Temporary Data 
; = 
: 4.5 Export Functions 
.def fifo_host_transfer 
i} 
ti 
; 5. SUBROUTINE CODE 
: HeaderBegin 
, 
r 
j ---------------------- ga a ae rte ee wus 
7 5.1 fifo_host_transfer 
, 
; 5.2 Functional Description 
; This routine transfers a FIFO(64) of data to host thru CH B. 
; In the process, after transferring data from DSP to FIFO sends a com- 
: mand to host thru CH A.The host acknowledges and sends a command to 
7 target (DSP) thru CH A. The host transfer can be disabled by setting 
H the K_HOST_FLAG =0 
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; 5.3 Activation 

: Activation example: 

: CALL fifo_host_transfer 

, 

; Reentrancy: No 

; Recursive : No 

r 

; 5.4 Inputs 

; Data structure: d_output_addr 

; Data Format: 16-bit variable 

i Modified: NO 

; Description: holds the starting addr of either PING/PONG addr. 

v 

A Data structure: d_fifo_count 

; Data Format: 16-bit var 

. Modified: Yes 

; Description: counter for # of transfers 

rg 

: Data structure: d_fifo_ptr 

: Data Format: 16-bit variable 

A Modified: Yes 

; Description: holds the output bffr addr. and incremented by 

; 32 for every transfer 

r 

; 5.5 Outputs 

rd 

; Data structure: AR7 

; Data Format: 16-bit output buffer pointer 

: Modified: Yes 

; Description: either point to PING/PONG buffer 

, f 

; 5.6 Global 

; Data structure: d_command_reg 

H Data Format: 16-bit variable 

; Modified: Yes 

7 Description: command from host is read thru CH A 

H Data structure: d_command_value 

; Data Format: 16-bit variable 

7 Modified: Yes 

7 Description: holds the command value 

v 

: 5.7 Special considerations for data structure 

; - 

, 

; 5.8 Entry and Exit conditions 

; aes ec sacl ba cal AR2|AR3|AR4 ‘ais aan a |? JP] PRC] se 

v 

jin ju | 1] 1 [Nu | 1 Be ee Be ee Be Re Be be i ee pales | 
Io | 1 | 1 [Nu | 1° |nu [NU [Nu |NU |NU |NU |NU |NU |U_ |UM|NU|NU|NU |NU|NU 


; Note : UM - Used & Modified, U - Used, NU - Not Used 
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; 5.9 Execution 
; Execution time: ?cycles 
7 Call rate: not applicable for this application 


; HeaderEnd 


; 5.10 Code 
i fifo_host_transfer: 
; LD #FIFO_DP,DP 
eke K_HOST_FLAG =1 
PORTR K_TRGCR_ADDR, d_command_reg ; while (port14 & BXST) 
BITF d_command_reg, K_BXST 
BC fifo_discard, TC ; FIFO discard 
MVDK d_output_addr, OUTBUF_P ; load PING/PONG bffr address 
RPT #K_FIFO_SIZE-1 ; write first set of 64 data 
3; to FIFO 
PORTW *QUTBUF_P+, K_CHB ; Fill FIFO 
ST #K_FIFO_FULL, d_command_value 
PORTW d_command_value, K_CHA ; write command to comnd reg A 
ST #1,d0_fifo_count ; start counting for tranfers 
MVKD OUTBUF_P,d_fifo_ptr ; save the fifo_ptr 
fifo_discard 
.endif 
RET 
.end 


KKK KKK KKK KKK KKK KKK KKK KK KKK KK KKK KKK KKK KKK KK KKK KKK KK KKK KKK KKK KKK KKK KKK KK KKK 


* This file includes the TCR register configuration of EVM 
KKEKKKKKKKKKKKKKKKKKKK KKK KKK KKK KKK KKK KKK KKK KKK KKK KK KKK KKK KKK KKK KKK KKKKKKEK 


K_AIC_RST -set Ob << 15 ; if AICRST=0, aic is reset 
K_USR_BOT -set O000b << 12 ; User discrete output bits 
7 O, 1,2 
K_RESRV -set 0000b << 8 ; Reserved bits 
K_USR_BIN .set O0b << 6 ; User discrete input bits 0,1 
K_RCV_BRST .set O0b << 4 ; Channel B receive status regs 
; buffer half or more 
K_XMT_BXST .set lib << 2 , Ch B trasnmit status register 
; buffer half or more 
K_RCV_ARST -set Up << 1 i Ch A receive register 
K_XMT_AXST .set Ob << 1 Ch A transmit register 
K_TCR_HIGH .set K_AIC_RST|K_USR sone RESRV 
K_TCR_LOW .set K_USR_BIN|K_RCV_BRST | K_XMT_BXST|K_RCV_ARST | K_XMT_AXST 
K_TCR -set K_TCR HIGH|K TCR_LOW 


KKK KKK KKK KKK KK KKK KKK KKK KKK KK KKK KKK KKK KKK KK KKK KKK KKK KKK KKK KKK KKK KKK KKK KKK KKK KK 


* this includes I/O address of CH_A, CH_B and different commands that’s been 


* passed between host and the target 
KKK KK KKK KK KKK KK KKK KKK KK KKK KK KKK KKK KEK KKK KK KKK KK KKK KEK KKK KKK KK KKK KKK KKKKKKKKKE KK 


K_O -set Oh ; constant 0 

K_FIFO_FULL -set OxFF ; Full FIFO command written by 
7; target 

K_FIFO_EMPTY -set OxEE , Empty FIFO command 
; written by host 

K_AXST_CLEAR -set OxAE ; Clear AXST empty command 


; written by the target 
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K_HANDSHAKE__CMD -set OxAB ; handshake CMD written by host 

K_CHB .set 12h ; Use Channel B as I/O interface 
; to 54x EVM for sending data 

K_CHA -set 10h ; Use Channel A as I/O interface 
; to 54x EVM for send command 
; to host 

K_TRGCR_ADDR -set 14h ; Target status control register 
; I/O address location 

K_AXST -set 1h ; Oh 

K_ARST -set 2h ; used to check the control bits 

K_BXST -set oh ; check if K_FIFO_SIZE 

.set 64 ; its a 64 FIFO 
K_FRAME_ SIZE -set 256 ; Frame size 
K_HOST_FLAG -set 1 ; if O, then host interface 


; is disabled 
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7 TEXAS INSTRUMENTS INCORPORATED 

7 DSP Data Communication System Development / ASP 

, 

; Archives: PVCS 

; Filename: hst_intl.asm 

Version: 10 

; Status : draft () 

: proposal (X) 

; accepted (_ ) dd-mm-yy/?acceptor. 
, 

3 AUTHOR Padma P. Mallela 

, 

; Application Specific Products 

7 Data Communication System Development 

: 12203 SW Freeway, MS 701 

7 Stafford, TX 77477 

i { 

3 

: IPR statements description (can be collected). 

i} 

¥ 

: (C) Copyright 1996. Texas Instruments. All rights reserved. 
, 

i{ 

; Change history: 

, 

7 VERSION DATE / AUTHORS COMMENT 
; 1.0 July-25-96 / P.Mallela original created 
, 

7} 

i{ 

; 1. ABSTRACT 

, 

7 1.1 Function Type 

; a.Core Routine 

; b.Subroutine 

, 

; 1.2 Functional Description 

; This file contains interrput service routine INT1: 
‘ 1) host_command_int1l 

: 1.3 Specification/Design Reference (optional) 

A INT1 is serviced whenever host writes to CHA 
, 

; 1.4 Module Test Document Reference 

: Not done 

, 

; 1.5 Compilation Information 

; Compiler: TMS320C54X ASSEMBLER 

7 Version: 1.02 (PC) 

A Activation: asm500 -s hst_intl.asm 

, 

; 1.6 Notes and Special Considerations 
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i 2. VOCABULARY 
; 
; 2.1 Definition of Special Words, Keywords (optional) 
; = 
; 2.2 Local Compiler Flags 
; a 
A 2.3 Local Constants 
; = 
i} 
i{ 
4 3. EXTERNAL RESOURCES 
, 
; 3.1 Include Files 
-mmregs 
- include "target.inc” 
; 3.2 External Data 
; .ref FIFO_DP 
.ref d_command_reg 
.ref d_fifo_count 
.cef FIFO_DP 
.ref d_command_value 
ref d_fifo_ptr 
.ref d_output_addr 
7 3.3 Import Functions 
i} 
i{ 
: 4. INTERNAL RESOURCES 
r 
; 4.1 Local Static Data 
; _ 
; 4.2 Global Static Data 
; = 
A 4.3 Dynamic Data 
; = 
; 4.4 Temporary Data 
; = 
; 4.5 Export Functions 
.def host_command_intl 
i} 
H 5. SUBROUTINE CODE 
; HeaderBegin 
r 
, 
; aaa Sa, ite a a aes Boe a ae 
; 5.1 host_command_intl 
r 
; 5.2 Functional Description 
7 The host generates INT1 DSP whenever it writes to CH A. In INT1 
; service routine, the command from host is read whether the FIFO 
H has been empty. Writes another 32 data from target to FIFO. 
; Sends a command to host. The host acknowledges the command and read 
; the 32 data from the FIFO and sends a command to the target for 
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another set of 32 data. 
256 processed samples are transferred to host. 
done background, 


i.e., INT1 is globally enabled. 


This process continues for 6 times till all 
Processing INT1 is 


; 5.3 Activation 

: Activation example: 

: BD host_command_int1l 

; PSHM STO 

; PSHM ST1 

, 

} Reentrancy: 

; Recursive 

, 

7 5.4 Inputs 

; Data structure: d_fifo_count 

: Data Format: 16-bit var 

; Modified: Yes 

7 Description: counter for # of transfers 

' 

; Data structure: d_fifo_ptr 

A Data Format: 16-bit variable 

; Modified: Yes 

A Description: holds the output bffr addr. and incremented by 
: 32 for every transfer 

, 

; 5.5 Outputs 

’ 

} Data structure: AR7 

: Data Format: 16-bit output buffer pointer 
; Modified: Yes 

; Description: either point to PING/PONG buffer 
v 

; 5.6 Global 

7 Data structure: d_command_reg 

i. Data Format: 16-bit variable 

; Modified: Yes 

; Description: command from host is read thru CH A 
a 

; Data structure: d_command_value 

: Data Format: 16-bit variable 

7 Modified: Yes 

; Description: holds the command value 

7 5.7 Special considerations for data structure 
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; 5.8 Entry and Exit conditions 
rd 
; Pe oom ee ener) aee sil (ata Vins cin - fea ca se 
, 
sin |u | 1] 1 [nu ] 1 ios ie [NU | NU se | NU hl ” ° teas as (abe ae 
eI | | | 
;out|u | 1 | 1 |[Nu | 1) [Nu [NU ' hs | NU e |Nu |NU |U- |UM|NU|NU|NU |NU|NU 
, 
; Note : UM —- Used & Modified, U - Used, NU - Not Used 
, 
; 5.9 Execution 
7 Execution time: ?cycles 
: Call rate: not applicable for this application 
r 
r 
; HeaderEnd 
; 5.10 Code 
.asg AR7, OUTBUF_P ; output buffer pointer 
-asg AR7,SV_RSTRE_AR7 
-sect "ra ko fa 
host_command_intl: 
PSHM AL 
PSHM AH 
PSHM AG 
PSHM SV_RSTRE_AR7 ; AR7 is used as a poiner for 
; output buffer 
LD #FIFO_DP,DP ; restore the DP 
PORTR K_CHA, d_command_value ; read command from host 
wait_host_receive_data 
PORTR K_TRGCR_ADDR,d_command_reg ; while (portl14 & AXST) 
BITF d_command_reg, K_ARST ; check FIFO empty 
BC wait_host_receive_data,TC ; branch occurs 
LD #K_FIFO_EMPTY,A ; indicate of FIFO empty 
SUB d_command_value,A 
bad_command 
BC bad_command, ANEQ ; read the command send by host 
LD # (K_FRAME_SIZE/ (K_FIFO_SIZE/2))-1,A 
SUB d_fifo_count,A ; check for complete transfer of 
; 256 samples 
BC start_remain_fifo_transfer, AGT 
BD transfer_over 
ST #0,d_fifo_count ; reset the fifo count 
; start_remain_fifo_transfer 
MVDK d_fifo_ptr, OUTBUF_P ; load PING/PONG bffr address 
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RPT 

PORTW 
ST 
PORTW 


ADDM 
MVKD 
transfer_over: 
POPM 
POPM 
POPM 
POPM 
POPM 
POPM 
RETE 
.end 


PRELIMINARY 


# (K_FIFO_SIZE/2)-1 
*OUTBUF_P+,K_CHB 
#K_FIFO_FULL, d_command_value 
d_command_value, K_CHA 


#1,d_fifo_count 
OUTBUF_P,d_fifo_ptr 


SV_RSTRE_AR7 
AG 

AH 

AL 

ST1 

STO 


’ 


r 


r 


a 


write 32 data to FIFO 
Fill FIFO half 


write command to cmmnd reg A 
for FIFO half 


save the fifo_ptr 


restore AR7 


Application Code Examples 10-115 


Application Code PRELIMINARY 


Example 10-24. — Function Calls on Host Side 


Host Action 
[RRR KR KK KR KK OK KR OK KR KK OK OK RK RK RK RR OK OR RK OK OK OK RK KK KK OK / 
/* FILE NAME: HOST.C */ 


aes C54x EVM/HOST COMMUNICATION FUNCTIONS -- HOST SIDE */ 
[ORO CIC IO ICICI IO ICICI III IOI IOI ICI ICI IOI ICI ICI II ICI IO III ICE IOI II I ICA I IOI I ICA IO a a Ck / 


#include "“graphic2.c” 
#include "host.h” /* flag names, constants a 


/* 


This function initializes the data buffer and reads the FIFO so that FIFO 
is empty when the real data transfers start 


RRP 8 sa Aa pS RIOR DOTY PUN ANNO ENE OU Ca NOUNS S NE ae UD I DIETS SI TEIN ILE a ORT aR */ 
void initialize_slave (void) 
{ 
int j; 
for (j=0;j3 < 64; Jt+) 
dataa[j] = inport(BDAT_REG) ; /* read data from data reg. */ 
for (j=0; 3 <256; j++) 
dataa[j] = 0; 


outport (CONT_REG, inport(CONT_REG) & Oxf7ff); 

} 

/* 

The target sends a command to the host after collecting 32 word data from DSP 
memory to FIFO. The host checks if the command has been received 


ns a a ee */ 
int receive_command_FIFO_FULL (void) 
/* RECEIVE COMMAND FROM EVM */ 
{ 
while(! (inport (CONT_REG) & ARST)) ; /* wait for evm to send command*/ 
reply = inport (ADAT_REG) ; /* vead command into reply*/ 
while ((reply & OxFF) !=0xFF) ; vreturn(reply) 
; /* return command for process’g*/ 

} 
/* */ 
f® This function sends a command to target for a new set of data from FIFO*/ 


void send_command_new_FIFO(command unsigned int command; 


{ 


command = OxEE; 
outport (ADAT_REG, command) ; 
while (inport (CONT_REG) & AXST); 


This initiates the handshake between the target and host. The host writes 
a command to target which sets the AXST flag to 1. The INT1 is generated 
whenever host writes to CH A. On the target side, INT1 is polled and reads 
the CH A.This clears ARST on target side. A command is written to Ch A on 
target after emptying the FIFO that sets AXSt =1. Later sets XF to go low. 
On host XF is polled and then reads CH A that clears ARST to 0 and AXST to 0 
on the target side 
Ls 
int receive_clear_AXST (void) 
/* RECEIVE COMMAND FROM EVM +f 
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{ 


command = QxAB; 
outport (ADAT_REG, command) ; 
while (inport (CONT_REG) & AXST); /* write command to evm */ 
while ((inport (CONT_REG) & XF)); 
while(! (inport (CONT_REG) & ARST)); /* wait for evm to send command*/ 
reply = inport (ADAT_REG) ; /* read command into reply xf 
while ((reply & OxAE) !=0xAE); 
return (reply); 
} 

/* ~ 


/* return command for process’ g*/ 


os x/ 
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[RRR KR KK KR KK KK OK OK KK OK RK KK RK OK RR A OR OR OR KK KK OK OK KK OK / 


[* FILE NAME: MASTER.C * / 
/* C54x EVM/HOST COMMUNICATION FUNCTIONS -- HOST SIDE */ 
[RRR KR KK KR KK KK KK KK KK RK RK KR RR OR RK KK OK OK KK OK OK OK / 

#define Fl 15104 

#define F2 15360 

#define F3 15616 


#define DATA_FRAME 256 

#include <bios.h> 

#include "“view2.c” 

int get_kbhit (void) ; 

extern void send_command_new_FIFO(unsigned int); 
void main(void) 


int count=0,n,fifo_size; 
int main_done =0; 


int done = 0; 
int hit; 
initialize_slave(); 
Vass a command is written to CH A to initiate handshake */ 
command_AXST = receive_clear_AXST() 
7 while (!main_done) 
done =0; 


init_graphics () 
; while(!done) 


count =0; 
if (kbhit () ) 


hit = get_kbhit (); 
setviewport (120, 433,290,445,0); 
clearviewport (); 

switch (hit) 


case (F1): amplitude = amplitude * 2 , outtextxy(1,1,”Amplitude 
; decreased”) 
; break; 

// case (F2): if (amplitude>=2) amplitude = amplitude / 2 

, outtextxy(1,1,”Amplitude 
; increased”) 
; break; 

case (F2) 3 done =1 ; Cclosegraph () 
; break; 

case (F3): done =1 ;main_done=1 
;closegraph () 
; break; 


else 


10-118 PRELIMINARY 


PRELIMINARY 


Application Code 


Example 10-25. Main Function Call on Host Side (Continued) 


{ 


receive_command_FIFO_FULL()j; 
for (fifo_size=0; fifo_size < 64; 
dataa[fifo_sizetcount] inport (BDAT_REG) ; 
send_command_new_FIFO (command) ; 

for (count=64; count< 256; count++) 


command_FIFO 


command_FIFO 
for (fifo_size=0; 
dataa[fifo_sizet+count ] 
count count+31; 

send_command_new_FIFO (command) ; 


receive_command_FIFO_FULL(); 
fifo_size < 32; 
inport (BDAT_REG) ; 


screen (); 


closegraph(); 
get_kbhit (void) 


unsigned int key = bioskey (0); 
fflush(stdin); 
return (key) ; 


} 


[RRR KK KK KK KR KR KK KR RR OR OK OK OK OK OK 


fifo_sizett) 


/* command from target*/ 


fifo_sizet+t) 


/* read 32 word fifo*/ 


/* send command to target*/ 


TOK RR RR OR OK OR OK OK KK OK / 


hi FILE NAME: HOST.H C54x is 
/* HOST SIDE FLAGS, CONSTANTS, COMMAND NUMBERS, AND GLOBAL VARIABLES */ 
* * 
Gee ee 
/* The numbers I’ve picked for the file I/O constants, command numbers, and */ 
/* basic control constants are not important. These numbers could really oy 
/* be anything, as long as two of them are not the sam Please notice the af 
/* pattern I used for file commands. Masks and pointers CANNOT be changed. sa 
include <stdio.h> 
/* ----------------------- FILE I/O CONSTANTS AND COMMAND --------------------- */ 
#define MAX_FRAME 256 /* size of data frame to be passed */ 
/* ------------------------- BASIC CONTROL CONSTANTS --------------------------*/ 
define STOP 99 
define NO 98 
define YES 97 
tdefine READY 96 
define CLEAR 95 
define ACKNOWLEDGE 0 
/*--------- -—--- POINTERS TO DATA, COMMAND, AND CONTROL REGISTERS ------------- * / 
unsigned int ADAT_REG = (unsigned int ) (0x240 + 0x800); 
unsigned int BDAT_REG = (unsigned int ) (0x240 + 0x804); 
unsigned int CONT_REG = (unsigned int ) (0x240 + 0x808); 
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/*------------ MASKS FOR READING MESSAGE FLAGS OF CONTROL REGISTER ----------— af 
unsigned int XF = (unsigned int ) 0x0020; 

unsigned int ARST = (unsigned int ) 0x0002; 

unsigned int AXST = (unsigned int ) 0x0001; 

unsigned int BRST_MASK = (unsigned int ) 0x0200; 

unsigned int BXST_MASK = (unsigned int ) 0x0008; 

is - —--- GLOBAL VARIABLES USED BY EVM.C AND MASTER.C - my: 
FILE*file[20]; /* stores ptrs to files in files.dat*/ 

int reply; /* integer for whatever =] 

int reply1[128]; 

int data[256]; 

int index; 

unsigned int command; 

unsigned int command1; 

unsigned int command_new_data; 

unsigned int command_FIFO; 

unsigned int command_AXST; 

unsigned int command_HANDSHAKE; 


int amplitude 
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= -10; 
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[BRK KK KK KK KK A A A A A A A A A A A A A I I I / 


/* FILE NAME: GRAPHIC2.C «i 
/* GRAPHICS DRIVER INITIALIZATION ROUTINE cf 
[% */ 


[BORK KK KK KK KK KK A A A A A A A A A A A I I / 
#include <graphics.h> 
#include <stdlib.h> 
#include <stdio.h> 
#include <conio.h> 
void init_graphics (void) 
{ 
all ¢) aa ae 
int gdriver = DETECT, gmode, errorcode; int left, top, right, bottom; 
initgraph(&gdriver, &gmode, "”"”); errorcode = graphresult (); 
if (errorcode != grOk); 


{ 


printf (”Graphics error: %s\n”, grapherrormsg(errorcode) ); 
printf(”Press any key to halt.\n”); 
getch(); 


cleardevice(); 
setlinestyle(0,0,1); 
setcolor (1); 
setfillstyle(1,3); 


rectangle (1,1,getmaxx()-1,getmaxy ()-1); 
setcolor (7); 
left = 5; 


top = getmaxy()/2 + - 123; 
right = 108; 
bottom = getmaxy()/2+ 163; 
rectangle (left,top, right, bottom) ; 
line(left,top + 30,right,top + 30); 
line (left,bottom+29,right,bottom+29) ; 
rectangle(left,top - 110,right+523,top - 5); 
rectangle (right+4,bottom + 5, right + 523,bottom + 70); 
rectangle (left,botom +5,right,bottom+70) ; 
setcolor(15); 
settextstyle(0,0,1); 
[* outtextxy (left+25,top+12,”CONTROL”) ; 
outtextxy (right+16,bottom+32,”MESSAGES:”) ; 
outtextxy (left+13,bottom+13,”AIC STATUS”) ;*/ 
outtextxy (left+2,bottomt+38,”Freq: 8kHz”); 
outtextxy (left+2,bottomt+53,”Gain: 2”); 
settextstyle(0,0,3); 
outtextxy (left+10,top-72,”C54X EVM Spectrum Analyzer”); 
setlinestyle(0,0,3); 
setcolor(15); 


left = getmaxx() / 2 — 206; 
top = getmaxy() / 2 - 123; 
right = getmaxx() / 2 + 312; 
bottom = getmaxy() / 2 + 163; 


rectangle(left, top, right, bottom); 
floodfill (left+10,topt+10,15); 
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Example 10-26. Graphic Drivers Routine (Continued) 


// 
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setcolor(15); 
setlinestyle(0,0,3); 


left = getmaxx()/2 - 311; 
top = getmaxy()/2 + 57 — 40; 
right = left + 25; 


bottom = top + 25; rectangle(left,top, right,bottom) ; 
setfillstyle(1,1); floodfill (left+5,topt+5,15); 
settextstyle(0,0,1); 

outtextxy (left+5,top+10,”"F3”); 

outtextxy (left+30,top+10,”Quit”); 

setcolor(15); setlinestyle(0,0,3); 


left = getmaxx()/2 - 311; 
top = getmaxy()/2 - 23 - 40; 
right = left + 25; 


bottom = top + 25; rectangle(left,top, right, bottom) 
setfillstyle(1,1); floodfill(left+5,topt+5,15); 
settextstyle(0,0,1); outtextxy (left+6,topt+10,”F1”); 
outtextxy (left+30,top+9, Decrease”) ; 

outtextxy (left+30,top+16, ”“amplitude”) ; 
setcolor(15); 

setlinestyle(0,0,3); 


left = getmaxx()/2 - 311; 
top = getmaxy()/2 + 17 - 40; 
right = left + 25; 


bottom = top + 25; 

rectangle (left,top, right, bottom) ; 
setfillstyle(1,1); 

floodfill (left+5,topt+5,15); 

settextstyle(0,0,1); outtextxy (left+6,topt+10,”F2”) ; 
outtextxy (left+30,top+9, ”Increase”) ; 

outtextxy (left+30,top+16, ”“amplitude”) ; 
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Example 10-27. Display the Data on the Screen 


[RRR RK KR KK KR A A A A A A A A A RK / 


/* FILE NAME: VIEW2.C */ 
[es DISPLAYS DATA ON THE SCREEN af 
[* a 


[RRR RK KR KK KR KR A A A A A A A RK / 


#include "“host.c” 
void screen (void) 
{ 
int x,y; 
int i,n; 
/* setup window (viewport) to display the AIC data */ 
xX = getmaxx()/2 - 203; 
y = getmaxy()/2; 
setviewport (x, y-120,x+512, y+160,1); 
/* move CP to left side of viewport */ 
x = getx(); 
y = gety() + 143; 
moveto (x,y); 
/* make waveform by drawing lines btwn 256 data points sent by EVM */ 
setlinestyle(0,0,1); 
setcolor (14); 
for (i=0;1<256;it+) lineto(x+1+2*i,y+tdataa[i]/amplitude) ; 
/* erase waveform just drawn by re-writing it in background color */ 
moveto (x,y); 
setcolor (3); 
for (i=0;1<256;it+) lineto(x+1+2*i,y+tdataa[i]/amplitude) ; 
} 
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Example 10-28. — Linker Command File for the Application 


[KKK RR KK KR KK KK KK OK KK RK KR KK KK KR KR OK OK KK OK RK OK OK KK 


* This linker command file is assigns the memory allocation for the application 
* based on the EVM54x specifically 541. 
FORK KK KK KK RR RK RK KR RR OR OR OR RK OR OK OK OK KK / 

vectors.obj 

init_54x.obj 

init_ser.obj 

init_aic.obj 

aic_cfg.obj 

memory.obj 

main.obj 

prces_int.obj 

rcev_intl.obj 

task.obj 

fir.obj 

iir.obj 

sym_fir.obj 

adapt .obj 

echo.obj 

rfft.obj 

bit_rev.obj 

fft.obj 

unpack.obj 

power.obj 

hand_shk.obj 

hst_intl.obj 

fifo.obj 

-o main.out 

-m main.map 


MEMORY 

{ 

PAGE 0: 
PROG : origin = 0x7000, length = 0x1000 
VECS : origin = Oxff80, length = Ox7f£ 
COFF_SYM: origin = 0x1400, length = 0x40 
COFF_FIR: origin = 0x1440, length = 0x40 
AIC_TBLE: origin = 0x1480, length = 0x10 
TASK_TBL: origin = 0x1490, length = 0x10 
TASK_INT: origin = 0x14a0, length = 0x10 
IIR_COFF: origin = 0x14b0, length = 0x10 
COEFFH : origin = 0x1500, length = 0x100 
TWID_SIN: origin = 0x1000, length = 0x80 
TWID_COS: origin = 0x1200, length = 0x80 
PAGE 1: /* Data space */ 
ALL_VARS: origin = 0x0080, length = 0x0080 
DRAM : origin = 0x0100, length = 0x1300 
EXT_DAT : origin = 0x1400, length = 0xE0O0O 
REGS : origin = 0x0000, length = 0x0060 
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Example 10-28. Linker Command File for the Application (Continued) 


SECTIONS 


{ 


text 
vectors 
main_prg 
zeropad 
aic_cnfg 
ser_cnfg 
fifo_fil 
task_hnd 
handshke 
fir_prog 
iir 
filter 
rfft_prg 
aic_reg 
task_int 
task_tbl 
coff_fir 
sym_fir 
Lis cork 
coeffh 
sin_tbl 
cos_tbl 
inpt_buf 
outdata 


UNION: 


{ 


} 
} 


£ft brttr 
adpt_sct: 


* (bufferw) 
-+=80h; 
* (bufferp) 


UNION: 


{ 


} 


fir bir 
cir_bfr 
coff_iir 
bufferh 
twid_sin 


UNION: 


{ 


fir cort 
cir_bfrl 
bufferx 
twid_cos 
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vvtvUunuvuyUyUyDUS, 
ye) 
fe) 
Q 


yee) 
ome) 
QAaQ 


AIC_TBL 
TASK_IN 
TASK_TB 
COFF_FI 
COFF_SY 
IIR_COF 
COEFFH 

TWID_SI 
TWID_CO 


VVVVVVVVVVV VV VV VV VV VV VV VV 


LT 
R 
M 
EF 


13 | 


N 
s 


PAG 


DRAM, align (1024) PAG 
DRAM, align (1024) PAG 
DRAM align (1024) PAGE 


AA So 


ira) 
PrRROWOOOOCCCOCCCOCCOCCCOCOCOCOCOCCOCCCCfo 


AA w 


Application Code 


/* code */ 
/* Vector table */ 


/* This is needed for alignment of 128 words */ 


> DRAM align (256) 


> DRAM align (256) 


PAGE 1 


PAGE 1 


Application Code Examples 
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Example 10-28. Linker Command File for the Application (Continued) 


GROUP : > ALL_VARS PAGE 1 
{ 


aic_vars 
rcev_vars 
fifo_var 
tsk_vars 
fir_vars 
iir_vars 


adpt_var 
fft_vars 
} 
stack : {} > DRAM PAGE 1 


} 
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Example 10-29. | Memory Map of TMS320C541 


7TMS320C541 MEMORY MAP 


MR 

MA 0x0000, 1, Ox002A, RAM ; MMRs 

MA 0x0030, 1, 0x0003, RAM j; 

MA 0x0060, 1, O0x0020, RAM ; SCRATCH PAD 

MA 0x0080, 1, O0x1380, RAM ; INTERNAL DATA RAM 
MA 0x0080, O, Ox1380, RAM ; INTERNAL PROGRAM RAM 
MA 0x9000, 0, Ox7000, ROM ; INTERNAL ROM 

ma 0x1400, 0, Oxec0O0, ram ; external ram 

ma 0x1400, 1, Oxec0O0O, ram ; external ram 

ma 0x0000, 2, 0x15, ioport ; i/o space 

map on 


, 

;Define reset alias to set PMST for MC mode 

, 

;alias myreset, ”"”e pmst = Oxff80; reset ” 

7e pmst = OxffeO ; MP mode, OVLY, DROM off CLKOUT on 
7e hbpenbl = 0x0000 


e *0x28 = 0x2000 ; two wait states on i/o, none for memory 
e *0x29 = 0x0000 ; no bank switching necessary 
dasm pc 


echo Loaded TMS320C54x evminit.cmd 


Application Code 
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AV) oJ=Valo [DAN 


Design Considerations for 
Using XDS510 Emulator 


This appendix assists you in meeting the design requirements of the Texas 
Instruments XDS510 emulator with respect to IEEE-1149.1 designs and 
discusses the XDS510 cable (manufacturing part number 2617698-0001). 
This cable is identified by alabel on the cable pod marked JTAG 3/5Vand sup- 
ports both standard 3-V and 5-V target system power inputs. 


The term JTAG, as used in this book, refers to Tl scan-based emulation, which 
is based on the IEEE 1149.1 standard. 


For more information concerning the IEEE 1149.1 standard, contact IEEE 
Customer Service: 


Address: IEEE Customer Service 
445 Hoes Lane, PO Box 1331 
Piscataway, NJ 08855-1331 


Phone: (800) 678-IEEE in the US and Canada 
(908) 981-1393 outside the US and Canada 


FAX: (908) 981-9667 Telex: 833233 
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A.1 Designing Your Target System’s Emulator Connector (14-Pin Header) 


JTAG target devices support emulation through a dedicated emulation port. 
This port is accessed directly by the emulator and provides emulation func- 
tions that are a superset of those specified by IEEE 1149.1. To communicate 
with the emulator, your target system must have a 14-pin header (two rows of 
seven pins) with the connections that are shown in Figure A—-1. Table A—1 
describes the emulation signals. 


Although you can use other headers, the recommended unshrouded, straight 
header has these DuPont connector systems part numbers: 


_} 65610-114 
LJ 65611-1114 
Lj} 67996—-114 
Lj) 67997-114 


Figure A-1. 14-Pin Header Signals and Header Dimensions 


TMS TRST 
7 up Pinto-pin spacing, 0.100 in. (X.Y) 
PD (Vcc) no pin (key)t Pin width, 0.025-in. square post 
TDO GND Pin length, 0.235-in. nominal 
TCK_RET GND 
TCK GND 
EMUO EMU1 


t While the corresponding female position on the cable connector is plugged to prevent improper 
connection, the cable lead for pin 6 is present in the cable and is grounded, as shown in the 
schematics and wiring diagrams in this appendix. 
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Table A—1. 14-Pin Header Signal Descriptions 


Emulatort Targett 


Signal Description State State 
EMUO Emulation pin 0 | /O 
EMU1 Emulation pin 1 | /O 
GND Ground 

PD(Vcc) Presence detect. Indicates that the emulation O 


cable is connected and that the target is 
powered up. PD should be tied to Vcc in the 
target system. 


TCK Test clock. TCK is a 10.368-MHz clock oO 
source from the emulation cable pod. This 
signal can be used to drive the system test 


clock. 
TCK_RET  Testclock return. Test clock input to the emu- O 
lator. May be a buffered or unbuffered version 
of TCK. 
TDI Test data input O | 
TDO Test data output | O 
TMS Test mode select O | 
TRSTF Test reset O | 


TI = input; O = output 

+Do not use pullup resistors on TRST: it has an internal pulldown device. In a low-noise 
environment, TRST can be left floating. In a high-noise environment, an additional pulldown 
resistor may be needed. (The size of this resistor should be based on electrical current 
considerations.) 
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Bus Protocol 


A.2 Bus Protocol 


A-4 


PRELIMINARY 


The IEEE 1149.1 specification covers the requirements for the test access port 
(TAP) bus slave devices and provides certain rules, summarized as follows: 


[1 The TMS and TDI inputs are sampled on the rising edge of the TCK signal 
of the device. 


[1 The TDO output is clocked from the falling edge of the TCK signal of the 
device. 


When these devices are daisy-chained together, the TDO of one device has 
approximately a half TCK cycle setup time before the next device’s TDI signal. 
This timing scheme minimizes race conditions that would occur if both TDO 
and TDI were timed from the same TCK edge. The penalty for this timing 
scheme is a reduced TCK frequency. 


The IEEE 1149.1 specification does not provide rules for bus master (emula- 
tor) devices. Instead, it states that the device expects a bus master to provide 
bus slave compatible timings. The XDS510 provides timings that meet the bus 
slave rules. 
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A.3 Emulator Cable Pod 


Figure A—2 shows a portion of the emulator cable pod. The functional features 
of the pod are: 


(1 TDO and TCK_RET can be parallel-terminated inside the pod if required 
by the application. By default, these signals are not terminated. 


(1 TCKis driven with a 74LVT240 device. Because of the high-current drive 
(32-mA IQ /IoH), this signal can be parallel-terminated. If TCK is tied to 
TCK_RET, you can use the parallel terminator in the pod. 


(1 TMS and TDIcan be generated from the falling edge of TCK_RET, accord- 
ing to the IEEE 1149.1 bus slave device timing rules. 


1 TMS and TDI are series-terminated to reduce signal reflections. 


Lj A 10.368-MHz test clock source is provided. You can also provide your 
own test clock for greater flexibility. 


Figure A—2. Emulator Cable Pod Interface 


5V 
iwa 8 


3 
‘ JP1 


74F175 
270 2 


TDO (pin 7) 


74LVT240 
10.368 MHz 
TMS (pin 1) 


GND (pins 4,6,8,10,12) 


TDI (pin 3) 
EMUO (pin 13) 
EMU1 (pin 14) ba TCK (pin 11)t 
[Vv 
S 2702 TRST (pin 2) 


JP2 74AS1004 
TCK_RET (pin 9)T t 


PD(Vcc) (pin 5) > 
1002 


RESIN 
TL7705A 


t The emulator pod uses TCK_RET as its clock source for internal synchronization. TCK is provided as an 
optional target system test clock source. 
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A.4 Emulator Cable Pod Signal Timing 


Figure A-3 shows the signal timings for the emulator cable pod. Table A—2 
defines the timing parameters illustrated in the figure. These timing parame- 
ters are calculated from values specified in the standard data sheets for the 
emulator and cable pod and are for reference only. Texas Instruments does 
not test or guarantee these timings. 


The emulator pod uses TCK_RET as its clock source for internal synchroni- 
zation. TCK is provided as an optional target system test clock source. 


Figure A—3. Emulator Cable Pod Timings 


i 1 a 


| 
TCK_RET ye  & fF & 
| | 


Table A—2. Emulator Cable Pod Timing Parameters 


No. Parameter Description Min Max Unit 
1 tc(TCK) Cycle time, TCK_RET 35 200 ns 
2 tw(TCKH) Pulse duration, TCK_RET high 15 ns 
3 tw(TCKL) Pulse duration, TCK_RET low 15 ns 
4 ta(TMSs) Delay time, TMS or TDI valid for TCK_RET low 6 20 ns 
5 tsu(TDO) Setup time, TDO to TCK_RET high 3 ns 
6 th(TDO) Hold time, TDO from TCK_RET high 12 ns 
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A.5 Emulation Timing Calculations 


Example A-1 and Example A-2 help you calculate emulation timings in your 
system. For actual target timing parameters, see the appropriate data sheet 
for the device you are emulating. 

The examples use the following assumptions: 


tsu(TTMS) Setup time, target TMS or TDI to TCK 


high 10 ns 
ta(TTDO) Delay time, target TDO from TCK low 15 ns 
td(bufmax) Delay time, target buffer maximum 10 ns 
td(bufmin) Delay time, target buffer minimum ins 
tbufskew Skew time, target buffer between two de- 1.35 ns 


vices in the same package: 
[td(oufmax) — ta(bufmin)] x 0.15 


tTCKfactor Duty cycle, assume a 40/60% duty cycle 0.4 
clock (40%) 
Also, the examples use the following values from Table A—2 on page A-6: 
ta(TMSmax) _ Delay time, emulator TMS or TDI from 20 ns 
TCK_RET low, maximum 
tsu(TDOmin) Setup time, TDO to emulator TCK_RET 3 ns 


high, minimum 
There are two key timing paths to consider in the emulation design: 


Ly TheTCK_RET-to-TMSorTDIpath,calledtyg(tck_RET-TMS/TDI)(Propaga- 
tion delay time) 


L) The TCK_RET-to-TDO path, called tog(TcK_RET-TDO) 


In the examples, the worst-case path delay is calculated to determine the 
maximum system test clock frequency. 
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Example A-1. Key Timing for a Single-Processor System Without Buffers 


E (TMsmax) * 'su srmus)| 


t = 
pd (TCK_RET-TMS/TDI) eae 


(20 ns + 10 ns) 
0.4 
= 75 ns, or 13.8 MHz 


It (TTD0) * su tl 


t i = 
pd (TCK_RET-TDO) troktactor 


(15 ns + 3 ns) 
0.4 
= 45 ns, or 22.2 MHz 


In this case, because the TCK_RET-to-TMS/TDI path requires more time to 
complete, it is the limiting factor. 


Example A-—2. Key Timing for a Single- or Multiple-Processor System With Buffered Input 
and Output 


ts (TMSmax) + teu (TTMS) - taea| 


_ 
pd (TCK_RET-TMS/TDI) trcktactor 


(20 ns + 10 ns + 1.35 ns) 
0.4 


= 78.4 ns, or 12.7 MHz 


ty (TTDO) + tsuctDOminy + ta eines 


7 
pd (TCK_RET-TDO) tt Cktactor 


(15 ns + 3 ns + 10 ns) 
0.4 


= 70 ns, or 14.38 MHz 


In this case also, because the TCK_RET-to-TMS/TDI path requires more time 
to complete, it is the limiting factor. 
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Emulation Timing Calculations 


In a multiprocessor application, it is necessary to ensure that the EMUO and 
EMU1 lines can go from a logic low level to a logic high level in less than 10 
us, this parameter is called rise time, t,;. This can be calculated as follows: 


ty 7 5(Roullup x Ndevices x Cload_per_device) 
= 5(4.7 kQ x 16 x 15 pF) 
= 5(4.7x 103 Qx 16 x 15 =no-12 F) 
= 5(1128 x 10-9) 
= 5.64 us 
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A.6 Connections Between the Emulator and the Target System 


It is extremely important to provide high-quality signals between the emulator 
and the JTAG target system. You must supply the correct signal buffering, test 
clock inputs, and multiple processor interconnections to ensure proper emula- 
tor and target system operation. 


Signals applied to the EMUO and EMU1 pins on the JTAG target device can 
be either input or output. In general, these two pins are used as both input and 
output in multiprocessor systems to handle global run/stop operations. EMUO 
and EMU1 signals are applied only as inputs to the XDS510 emulator header. 


A.6.1 Buffering Signals 


If the distance between the emulation header and the JTAG target device is 
greater than 6 inches, the emulation signals must be buffered. If the distance 
is less than 6 inches, no buffering is necessary. Figure A-4 shows the simpler, 
no-buffering situation. 


The distance between the header and the JTAG target device must be no more 
than 6 inches. The EMUO and EMU1 signals must have pullup resistors con- 
nected to Vcc to provide a signal rise time of less than 10 us. A 4.7-kQ resistor 
is suggested for most applications. 


Figure A-4. Emulator Connections Without Signal Buffering 


A-10 


a 6 inches or less | 


Voc 


JTAG device Emulator header 
EMUO EMUO PD 
EMU1 @ EMU1 


TRST 


TRST 


TMS 
TDI 

TDO 
TCK ’ TCK 


TMS 
TDI 
TDO 


TCK_RET 


Figure A-5 shows the connections necessary for buffered transmission sig- 
nals. The distance between the emulation header and the processor is greater 
than 6 inches. Emulation signals TMS, TDI, TDO, and TCK_RET are buffered 
through the same device package. 
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Connections Between the Emulator and the Target System 


Figure A-5. Emulator Connections With Signal Buffering 
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Greater than 
6 inches 
Vcc Voc 
JTAG device Emulator header a 
EMUO EMUO PD 
EMU1 l EMU1 
TRST TAST 
Ts <J e TS 
TDI &. TDI 
TDO > TDO 
TCK TCK 
> TCK_RET Vv 
GND 


The EMUO and EMU1 signals must have pullup resistors connected to Vcc to 
provide a signal rise time of less than 10 us. A 4.7-kQ resistor is suggested for 
most applications. 


The input buffers for TMS and TDI should have pullup resistors connected to 
Vcc to hold these signals at a known value when the emulator is not con- 
nected. A resistor value of 4.7 kQ or greater is suggested. 


To have high-quality signals (especially the processor TCK and the emulator 
TCK_RET signals), you may have to employ special care when routing the 
printed wiring board trace. You also may have to use termination resistors to 
match the trace impedance. The emulator pod provides optional internal paral- 
lel terminators on the TCK_RET and TDO. TMS and TDI provide fixed series 
termination. 


Because TRST is an asynchronous signal, it should be buffered as needed to 
ensure sufficient current to all target devices. 


Design Considerations for Using XDS510 Emulator A-11 


Connections Between the Emulator and the Target System PRELIMINARY 


A.6.2 Using a Target-System Clock 


Figure A-6 shows an application with the system test clock generated in the 
target system. In this application, the emulator’s TCK signal is left uncon- 
nected. 


Figure A-6. Target-System-Generated Test Clock 


Greater than 
6 inches 
Voc 
JTAG device Emulator header 
EMUO EMUO 
EMU1 I EMU1 
TRST TRST 
TS <] rs TMS 
TDI e TDI 
TDO > TDO 
TCK NC TCK 
> TCK_RET 


System test clock 


Note: Whenthe TMS and TDI lines are buffered, pullup resistors must be used to hold the buffer 
inputs at a known level when the emulator cable is not connected. 


There are two benefits in generating the test clock in the target system: 


(J The emulator provides only a single 10.368-MHz test clock. If you allow 
the target system to generate your test clock, you can set the frequency 
to match your system requirements. 


[1 In some cases, you may have other devices in your system that require 
a test clock when the emulator is not connected. The system test clock 
also serves this purpose. 
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A.6.3 Configuring Multiple Processors 


Connections Between the Emulator and the Target System 


Figure A—7 shows a typical daisy-chained multiprocessor configuration that 
meets the minimum requirements of the IEEE 1149.1 specification. The 
emulation signals are buffered to isolate the processors from the emulator and 
provide adequate signal drive for the target system. One of the benefits of this 
interface is that you can slow down the test clock to eliminate timing problems. 
Follow these guidelines for multiprocessor support: 


L 


Ly 


Figure A—7. Multiprocessor Connections 


JTAG device 
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Emulator header 


EMUO PD 


EMU1 


TRST 


TMS 
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TDO 
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TCK_RET 


Voc 
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The processor TMS, TDI, TDO, and TCK signals must be buffered through 
the same physical device package for better control of timing skew. 


The input buffers for TMS, TDI, and TCK should have pullup resistors con- 
nected to Vcc to hold these signals at a known value when the emulator 
is not connected. A resistor value of 4.7 kQ or greater is suggested. 


Buffering EMU0 and EMU1 is optional but highly recommended to provide 
isolation. These are not critical signals and do not have to be buffered 
through the same physical package as TMS, TCK, TDI, and TDO. 


Physical Dimensions for the 14-Pin Emulator Connector PRELIMINARY 


A.7 Physical Dimensions for the 14-Pin Emulator Connector 


The JTAG emulator target cable consists of a 3-foot section of jacketed cable 
that connects to the emulator, an active cable pod, and a short section of jack- 
eted cable that connects to the target system. The overall cable length is 
approximately 3 feet 10 inches. Figure A—-8 and Figure A—9 (page A-15) show 
the physical dimensions for the target cable pod and short cable. The cable 
pod box is nonconductive plastic with four recessed metal screws. 


Figure A—8. Pod/Connector Dimensions 


2.70 in., nominal 


4.50 in., nominal 


@ 9.50 in., nominal 
@ 


0.90 in., 
nominal 


Emulator cable pod CASS Connector 


EE 
Short, jacketed cable it 


See Figure A-9 


Note: All dimensions are in inches and are nominal dimensions, unless otherwise specified. Pin-to-pin spacing on the connec- 
tor is 0.100 inches in both the X and Y planes. 
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Figure A-9. 14-Pin Connector Dimensions 
> sn 0.20 i nch, 
nominal 


Cable 


Connector, side view 


0.100 inch, 
nominal 
(pin spacing) 


Key, pin 6 


Cable nominal 


T 0.87 inch, 


0.100 inch, 
nominal 
(pin spacing) 


a4 


Connector, front view 


] 
2 rows of pins 


PRELIMINARY Design Considerations for Using XDS510 Emulator A-15 


Emulation Design Considerations PRELIMINARY 


A.8 Emulation Design Considerations 


This section describes the use and application of the scan path linker (SPL), 
which can simultaneously add all four secondary JTAG scan paths to the main 
scan path. It also describes the use of the emulation pins and the configuration 
of multiple processors. 


A.8.1_ Using Scan Path Linkers 


A-16 


You can use the Tl ACT8997 scan path linker (SPL) to divide the JTAG 
emulation scan path into smaller, logically connected groups of 4 to 16 
devices. As described in the Advanced Logic and Bus Interface Logic Data 
Book, the SPL is compatible with the JTAG emulation scanning. The SPL is 
capable of adding any combination of its four secondary scan paths into the 
main scan path. 


A system of multiple, secondary JTAG scan paths has better fault tolerance 
and isolation than a single scan path. Since an SPL has the capability of adding 
all secondary scan paths to the main scan path simultaneously, it can support 
global emulation operations, such as starting or stopping a selected group of 
processors. 


Tl emulators do not support the nesting of SPLs (for example, an SPL 
connected to the secondary scan path of another SPL). However, you can 
have multiple SPLs on the main scan path. 


Scan path selectors are not supported by this emulation system. The TI 
ACT8999 scan path selector is similar to the SPL, but it can add only one of 
its secondary scan paths at a time to the main JTAG scan path. Thus, global 
emulation operations are not assured with the scan path selector. 


You can insert an SPL on a backplane so that you can add up to four device 
boards to the system without the jumper wiring required with nonbackplane 
devices. You connect an SPL to the main JTAG scan path in the same way you 
connect any other device. Figure A-10 shows how to connect a secondary 
scan path to an SPL. 
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Figure A-10. Connecting a Secondary JTAG Scan Path to a Scan Path Linker 
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SPL 
DTCK Tp YTAGO 
TDI | DTDOO TS 
TMS | DTMSO TCK 
+ TCK | DTDIO TRST 
TRST | aepoi TDO 
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TRO DTMS1 


DTDI1 


DTDO2 
DTMS2 
DTDI2 


DTDO3 
DTMS3 
DTDI3 


The TRST signal from the main scan path drives all devices, even those on 
the secondary scan paths of the SPL. The TCK signal on each target device 
onthe secondary scan path of an SPL is driven by the SPL’s DTCK signal. The 
TMS signal on each device onthe secondary scan path is driven by the respec- 
tive DTMS signals on the SPL. 


DTDOO on the SPL is connected to the TDI signal of the first device on the sec- 
ondary scan path. DTDIO on the SPL is connected to the TDO signal of the last 
device in the secondary scan path. Within each secondary scan path, the TDI 
signal of a device is connected to the TDO signal of the device before it. If the 
SPL is on a backplane, its secondary JTAG scan paths are on add-on boards; 
if signal degradation is a problem, you may need to buffer both the TRST and 
DTCK signals. Although degradation is less likely for DTMSn signals, you may 
also need to buffer them for the same reasons. 
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A.8.2. Emulation Timing Calculations for a Scan Path Linker (SPL) 


Example A-3 and Example A-4 help you to calculate the key emulation tim- 
ings in the SPL secondary scan path of your system. For actual target timing 
parameters, see the appropriate device data sheet for your target device. 


The examples use the following assumptions: 


tsu(TTMS) Setup time, target TMS/TDI to TCK high 10 ns 
ta(TTDO) Delay time, target TDO from TCK low 15 ns 
td(bufmax) Delay time, target buffer, maximum 10 ns 
td(bufmin) Delay time, target buffer, minimum 1ns 
t(bufskew) Skew time, target buffer, between two 1.35 ns 


devices in the same package: 
[td(bufmax) — td(bufmin)] x 0.15 


t(TCKfactor) Duty cycle, TCK assume a 40/60% clock ion 


Also, the examples use the following values from the SPL data sheet: 


ta(DTMSmax) Delay time, SPL DTMS/DTDO from TCK 31 ns 
low, maximum 

tsu(DTDLmin) Setup time, DTDI to SPL TCK high, 7 ns 
minimum 

td(DTCKHmin) Delay time, SPL DTCK from TCK high, 2ns 
minimum 

tg(DTCKLmax) Delay time, SPL DTCK from TCK low, 16 ns 
maximum 


There are two key timing paths to consider in the emulation design: 


L} The TCK-to-DTMS/DTDO path, called toq(TcK-DTMS) 


LJ The TCK-to-DTDI path, called tog¢Tck-DTDI) 


A-18 PRELIMINARY 


PRELIMINARY Emulation Design Considerations 


Of the following two cases, the worst-case path delay is calculated to deter- 
mine the maximum system test clock frequency. 


Example A-3. Key Timing for a Single-Processor System Without Buffering (SPL) 


+ ty 


ty (DTMSmax) (DTCKHmin) + tou (TTMS) 


t = 
pay er D EMS) ttcktactor 


_ (31 ns + 2 ns + 10 ns) 
0.4 


= 107.5 ns, or 9.3 MHz 


ta(rtp0) t ty (oTcKLmax) +t 


su roi) 
tod (TCK-DTDI) = t 


TCKfactor 


_ (15 ns + 16 ns + 7 ns) 
0.4 


= 9.5 ns, or 10.5 MHz 


In this case, the TCK-to-DTMS/DTDL path is the limiting factor. 


Example A-—4. Key Timing for a Single- or Multiprocessor-System With Buffered Input 
and Output (SPL) 


ta(oTMSmax) + '(oTcKHmin) + tsucttms) + "(outskew) 


t : — 
perch TMs) trcktactor 


_ (381 ns + 2 ns + 10 ns + 1.35 ns) 
0.4 


= 110.9 ns, or 9.0 MHz 


It. (TTDO) * tg(DTCKLmax) * tsu(OTDLmin) + ty (outskew) 


t a = 
Pach cnet) treKtactor 


_ (15 ns + 15 ns + 7 ns + 10 ns) 
0.4 


= 120 ns, or 8.3 MHz 


In this case, the TCK-to-DTDI path is the limiting factor. 
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A.8.3_ Using Emulation Pins 
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The EMU0/1 pins of TI devices are bidirectional, 3-state output pins. When in 
an inactive state, these pins are at high impedance. When the pins are active, 
they provide one of two types of output: 


(1 Signal Event. The EMU0/1 pins can be configured via software to signal 
internal events. In this mode, driving one of these pins low can cause 
devices to signal such events. To enable this operation, the EMU0/1 pins 
function as open-collector sources. External devices such as logic analyz- 
ers can also be connected to the EMU0/1 signals in this manner. If such 
an external source is used, it must also be connected via an open-collector 
source. 


() External Count. The EMU0/1 pins can be configured via software as 
totem-pole outputs for driving an external counter. If the output of more 
than one device is configured for totem-pole operation, then these devices 
can be damaged. The emulation software detects and prevents this condi- 
tion. However, the emulation software has no control over external 
sources on the EMU0/1 signal. Therefore, all external sources must be 
inactive when any device is in the external count mode. 


Tl devices can be configured by software to halt processing if their EMUO0/1 
pins are driven low. This feature combined with the signal event output, allows 
one TI device to halt all other Tl devices on a given event for system-level de- 


bugging. 


If you route the EMU0/1 signals between multiple boards, they require special 
handling because they are more complex than normal emulation signals. 
Figure A-11 shows an example configuration that allows any processor in the 
system to stop any other processor in the system. Do not tie the EMU0/1 pins 
of more than 16 processors together in a single group without using buffers. 
Buffers provide the crisp signals that are required during a RUNB (run bench- 
mark) debugger command or when the external analysis counter feature is 
used. 
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Figure A-11. EMU0/1 Configuration to Meet Timing Requirements of Less Than 25 ns 


i re, Target board 1_ 
| | 
Pullup | 
| Open- resistor | 
| collector 4 ae EMUO/1 | 
| drivers | 
Backplane | <] Device} _—| Device | 
XCNT_ENABLE | 1 n | 
a res za 
EMU0/1-IN 
? 
oe Pullup 
resistor | | -—-— 
EMUO/1-OUT ot i Target board m | 
| | 
To emulator EMUO | | 
TCK Pullup | 
| Open- resistor | 
| collector ¢ Ee ° EMUO/1 | 
| drivers | 
| <| Device] __——| Device | 
| : ‘ | 
ee ee ae eee =| 


Notes: 1) Thelowtime on EMU0/1-IN should be at least one TCK cycle and less than 10 us. Software sets the EMU0/1-OUT 
pin to a high state. 


2) To enable the open-collector driver and pullup resistor on EMU1 to provide rise/fall times of less than 25 ns, the modifi- 
cation shown in this figure is suggested. Rise times of more than 25 ns can cause the emulator to detect false edges 
during the RUNB command or when the external counter selected from the debugger analysis menu is used. 


These seven important points apply to the circuitry shown in Figure A-11 and 
the timing shown in Figure A-12: 


LJ Open-collector drivers isolate each board. The EMUO0/1 pins are tied 
together on each board. 


Lj Atthe board edge, the EMU0/1 signals are split to provide both input and 
output connections. This is required to prevent the open-collector drivers 
from acting as latches that can be set only once. 


[1 The EMUO0/1 signals are bused down the backplane. Pullup resistors must 
be installed as required. 
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(J The bused EMU0/1 signals go into a programmable logic array device 


PAL® whose function is to generate a low pulse on the EMU0/1-IN signal 
when a low level is detected on the EMU0/1-OUT signal. This pulse must 
be longer than one TCK period to affect the devices but less than 10 us 
to avoid possible conflicts or retriggering once the emulation software 
clears the device’s pins. 


During a RUNB debugger command or other external analysis count, the 
EMU0/1 pins on the target device become totem-pole outputs. The EMU1 
pin is a ripple carry-out of the internal counter. EMU0 becomes a proces- 
sor-halted signal. During a RUNB or other external analysis count, the 
EMUO0/1-IN signal to all boards must remain in the high (disabled) state. 
You must provide some type of external input (XCNT_ENABLE) to the 
PAL® to disable the PAL® from driving EMUO/1-IN to a low state. 


If you use sources other than TI processors (such as logic analyzers) to 
drive EMU0/1, their signal lines must be isolated by open-collector drivers 
and be inactive during RUNB and other external analysis counts. 


You must connect the EMU0/1-OUT signals to the emulation header or 
directly to a test bus controller. 


Figure A—12. Suggested Timings for the EMU0 and EMU17 Signals 


EMU0/1-OUT NEN 
EMU0/1-IN NUON / 
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Figure A—13. EMU0/1 Configuration With Additional AND Gate to Meet Timing 
Requirements of Greater Than 25 ns 


| 
| 
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| 
| 


| 
| 
| 
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oO 
+< 
oO 
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oO 
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[?) 
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To emulator EMU1 


| | 
| | 
| | 
: : EMU 1 signal from other boards 
| | 
L 


Notes: 1) Thelowtime on EMU0/1-IN should be at least one TCK cycle and less than 10 us. Software will set the EMU0/1-OUT 
port to a high state. 


2) To enable the open-collector driver and pullup resistor on EMU‘1 to provide rise/fall time of greater than 25 ns, the 
modification shown in this figure is suggested. Rise times of more than 25 ns can cause the emulator to detect false 
edges during the RUNB command or when the external counter selected from the debugger analysis menu is used. 
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You do not need to have devices on one target board stop devices on another 
target board using the EMU0/1 signals (see the circuit in Figure A-14). In this 
configuration, the global-stop capability is lost. It is important not to overload 
EMUO0/1 with more than 16 devices. 


Figure A-14. EMUO0/1 Configuration Without Global Stop 


ee 4 
fi Target board 1 | 
| 
| Pullup | 
l resistor | 
EMUO/1 : 
Pullup | - - | 
n 
To emulator ry | | 
EMUO/1 l | 
a J 
oS | 
| Target boardm | 
| | 
| | 
| Pullup | 
| resistor | 
| ® EMUO/1 | 
| | 
| Device} ‘| Device | 
n 
| | 
ae eee ee ee ae ee eee | 


Note: Theopen-collector driver and pullup resistor on EMU1 must be able to provide rise/fall times of less than 25 ns. Rise times 
of more than 25 ns can cause the emulator to detect false edges during the RUNB command or when the external counter 
selected from the debugger analysis menu is used. If this condition cannot be met, then the EMU0/1 signals from the 
individual boards must be ANDed together (as shown in Figure A-14) to produce an EMU0/1 signal for the emulator. 


A.8.4 Performing Diagnostic Applications 


For systems that require built-in diagnostics, it is possible to connect the 
emulation scan path directly to a TI ACT8990 test bus controller (TBC) instead 
of the emulation header. The TBC is described in the Texas Instruments 
Advanced Logic and Bus Interface Logic Data Book. Figure A-15 shows the 
scan path connections of n devices to the TBC. 
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Figure A-15. TBC Emulation Connections for n JTAG Scan Paths 
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In the system design shown in Figure A-15, the TBC emulation signals TCKI, 
TDO, TMSO, TMS2/EVNTO, TMS3/EVNT1, TMS5/EVNT3, TCKO, and TDIO 
are used, and TMS1, TMS4/EVNT2, and TDI1 are not connected. The target 
devices’ EMU0 and EMU1 signals are connected to Vcc through pullup resis- 
tors and tied to the TBC’s TMS2/EVNTO and TMS3/EVNT1 pins, respectively. 
The TBC’s TCKI pin is connected to a clock generator. The TCK signal for the 
main JTAG scan path is driven by the TBC’s TCKO pin. 


On the TBC, the TMSO pin drives the TMS pins on each device on the main 
JTAG scan path. TDO on the TBC connects to TDI on the first device on the 
main JTAG scan path. TDIO on the TBC is connected to the TDO signal of the 
last device on the main JTAG scan path. Within the main JTAG scan path, the 
TDI signal of a device is connected to the TDO signal of the device before it. 
TRST for the devices can be generated either by inverting the TBC’s 
TMS5/EVNTS signal for software control or by logic on the board itself. 
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Development Support and Part Order Information 


This appendix provides development support information, device part num- 
bers, and support tool ordering information for the 54x. 


Each ’54x support product is described in the TMS320 DSP Development 
Support Reference Guide. In addition, more than 100 third-party developers 
offer products that support the Tl TMS320 family. For more information, refer 
to the TMS320 Third-Party Support Reference Guide. 


For information on pricing and availability, contact the nearest TI Field Sales 
Office or authorized distributor. See the list at the back of this book. 
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B.1 Development Support 


This section describes the development support provided by Texas Instru- 
ments. 


B.1.1 Development Tools 


Tl offers an extensive line of development tools for the 54x generation of 
DSPs, including tools to evaluate the performance of the processors, generate 
code, develop algorithm implementations, and fully integrate and debug soft- 
ware and hardware modules. 


Code Generation Tools 


(1 The optimizing ANSI C compiler translates ANSI C language directly into 
highly optimized assembly code. You can then assemble andlink this code 
with the Tl assembler/linker, which is shipped with the compiler. This prod- 
uct is currently available for PCs (DOS, DOS extended memory, OS/2), 
HP workstations, and SPARC workstations. See the TMS320C54x Opti- 
mizing C Compiler User’s Guide for detailed information about this tool. 


(J The assembler/linker converts source mnemonics to executable object 
code. This product is currently available for PCs (DOS, DOS extended 
memory, OS/2). The 54x assembler for HP and SPARC workstations is 
available only as part of the optimizing ‘54x compiler. See the 
TMS320C54x Assembly Language Tools User’s Guide for detailed in- 
formation about available assembly-language tools. 


System Integration and Debug Tools 


(J The simulator simulates (via software) the operation of the 54x and can 
be used in C and assembly software development. This product is current- 
ly available for PCs (DOS, Windows), HP workstations, and SPARC 
workstations. See the TMS320C54x C Source Debugger User’s Guide for 
detailed information about the debugger. 


(i The XDS510 emulator performs full-speed in-circuit emulation with the 
54x, providing access to all registers as well as to internal and external 
memory of the device. It can be used in C and assembly software develop- 
ment and has the capability to debug multiple processors. This product is 
currently available for PCs (DOS, Windows, OS/2), HP workstations, and 
SPARC workstations. This product includes the emulator board (emulator 
box, power supply, and SCSI connector cables in the HP and SPARC ver- 
sions), the ‘54x C source debugger and the JTAG cable. 


B-2 PRELIMINARY 


PRELIMINARY 


Development Support 


Because the ’C2xx, ’C3x, 'C4x, and ’C5x XDS510 emulators also come 
with the same emulator board (or box) as the 54x, you can buy the 54x C 
Source Debugger Software as a separate product called the 54x C 
Source Debugger Conversion Software. This enables you to debug ’54x 
applications with a previously purchased emulator board. The emulator 
cable that comes with the ’C3x XDS510 emulator cannot be used with the 
54x. You need the JTAG emulation conversion cable (see Section B.2) 
instead. The emulator cable that comes with the ‘C5x XDS510 emulator 
can be used with the ’54x without any restriction. See the TMS320C54x C 
Source Debugger User's Guide) for detailed information about the ’54x 
emulator. 


(7 The TMS320C54x evaluation module (EVM) is a PC/AT plug-in card that 
lets you evaluate certain characteristics of the 54x digital signal processor 
to see if it meets your application requirements. The ‘54x EVM carries a 
541 DSP on board to allow full-speed verification of 54x code. The EVM 
has 5K bytes of on-chip program/data RAM, 28K bytes of on-chip ROM, 
two serial ports, a timer, access to 64K bytes each of external program and 
data RAM, and an external analog interface for evaluation of the ’54x fami- 
ly of devices for applications. See the 7TMS320C54x Evaluation Module 
Technical Reference for detailed information about the ’54x EVM. 


B.1.2 Third-Party Support 
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The TMS320 family is supported by products and services from more than 100 
independent third-party vendors and consultants. These support products 
take various forms (both as software and hardware), from cross-assemblers, 
simulators, and DSP utility packages to logic analyzers and emulators. The 
expertise of those involved in support services ranges from speech encoding 
and vector quantization to software/hardware design and system analysis. 


To ask about third-party services, products, applications, and algorithm devel- 
opment packages, contact the third party directly. Refer to the TMS320 Thira- 
Party Support Reference Guide for addresses and phone numbers. 
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B.1.3 Technical Training Organization (TTO) TMS320 Workshops 


B.1.4 Assistance 
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*54x DSP Design Workshop. This workshop is tailored for hardware and soft- 
ware design engineers and decision-makers who will be designing and utiliz- 
ing the ’54x generation of DSP devices. Hands-on exercises throughout the 
course give participants a rapid start in developing 54x design skills. Micropro- 
cessor/assembly language experience is required. Experience with digital 
design techniques and C language programming experience is desirable. 


These topics are covered in the ‘54x workshop: 


(j ‘54x architecture/instruction set 

Lj Use of the PC-based software simulator 
Lj] Use of the 54x assembler/linker 

J C programming environment 

L1 System architecture considerations 

_} Memory and I/O interfacing 

Lj Development support 


For registration information, pricing, or to enroll, call (800)336—5236, ext. 3904. 


For assistance to TMS320 questions on device problems, development tools, 
documentation, software upgrades, and new products, you can contact TI. 
See /f You Need Assistance in Preface for information. 
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B.2 Part Order Information 


This section describes the part numbers of 54x devices, development support 
hardware, and software tools. 


B.2.1 Device and Development Support Tool Nomenclature Prefixes 
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To designate the stages in the product development cycle, Tl assigns prefixes 
to the part numbers of all TMS320 devices and support tools. Each TMS320 
device has one of three prefix designators: TMX, TMP, or TMS. Each support 
tool has one of two possible prefix designators: TMDX or TMDS. These pre- 
fixes represent evolutionary stages of product development from engineering 
prototypes (TMX/TMDxX) through fully qualified production devices and tools 
(TMS/TMDS). This development flow is defined below. 


Device Development Evolutionary Flow: 


TMX ~The partis an experimental device that is not necessarily representa- 
tive of the final device’s electrical specifications. 

TMP —Thepartis adevice from a final silicon die that conforms to the device’s 
electrical specifications but has not completed quality and reliability 
verification. 


TMS _ The partis a fully qualified production device. 
Support Tool Development Evolutionary Flow: 


TMDX The development-support product that has not yet completed Texas 
Instruments internal qualification testing. 


TMDS The development-support product is a fully qualified development 
support product. 


TMX and TMP devices and TMDX development support tools are shipped with 
the following disclaimer: 


“Developmental product is intended for internal evaluation purposes.” 


TMS devices and TMDS development support tools have been fully character- 
ized, and the quality and reliability of the device has been fully demonstrated. 
Texas Instruments standard warranty applies to these products. 


NE 
Note: 


Itis expected that prototype devices (TMX or TMP) have a greater failure rate 
than standard production devices. Texas Instruments recommends that 
these devices notbe used in any production system, because their expected 
end-use failure rate is still undefined. Only qualified production devices 


should be used. 
ee!) 
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B.2.2 Device Nomenclature 


Tl device nomenclature includes the device family name and a suffix. 
Figure B—1 provides a legend for reading the complete device name for any 
54x device family member. 


Figure B—1. TMS320C54x Device Nomenclature 


TMS320 LC 542 PGE 1 — 50 


LL 
Device Family mIrS 
TMS320 = DSP Family 
PLL Option 
1= 1,1.5,2,3 
Technology 2= 1,4,4.5,5 
C=5V +10% CMOS No number = Software-programmable PLL 
LC = 3.3 V +10% CMOS 
VC =3 V +10% CMOS 
Package Type 
PBK = 128-pin TQFP 
Device PGE= 144-pin TQFP 
Bax: 541 542 543 oy = ee pin rr 
PJ = 100-pin QFP 
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B.2.3 Development Support Tools 


Table B—1 lists the development support tools available for the ‘54x, the plat- 
form on which they run, and their part numbers. 


Table B-1._ | Development Support Tools Part Numbers 


Development Tool Platform Part Number 
Assembler/Linker PC (DOS™) TMDS324L850-02 
C Compiler/Assembler/Linker PC (DOS™, Windows™, OS/2™) TMDS324L855-02 
C Compiler/Assembler/Linker HP (HP-UX™)/SPARC™ (SunOS™) =TMDS324L555-08 
C Source Debugger Conversion Software PC (DOS™, Windows™, OS/2™) TMDS32401L0 

(XDS510™) 
C Source Debugger Conversion Software HP (HP-UX™)/SPARC™ (SunOS™) TMDS32406L0 
(XDS510WS”™) 
Evaluation Module (EVM) PC (DOS™, Windows™, OS/2™) TMDX3260051 
Simulator (C language) PC (DOS™, Windows™) TMDS324L851-02 
Simulator (C language) HP (HP-UX™) /SPARC™ (Sun OS™) = TMDS324L551-09 
XDS510 Emulatort PC (DOS™, Windows™, OS/2™) TMDS00510 
XDS510WS Emulatort HP (HP-UX™)/SPARC™ (Sun OS™) |= TMDS00510WS 
(SCSI) 
3 V/5 V PC/SPARC JTAG Emulation Cable XDS510™ / XDS510WS™ TMDS3080002 


Tt Includes XDS510 board and JTAG cable; TMDS32401L0 C-source debugger conversion software not included 
+ Includes XDS510WS box, SCSI cable, power supply, and JTAG cable; TMDS32406L0 C-source debugger conversion software 
not included 
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Glossary 


A: See accumulator A. 


accumulator: A register that stores the results of an operation and provides 
an input for subsequent arithmetic logic unit (ALU) operations. 


accumulator A: 40-bit register that stores the result of an operation and 
provides an input for subsequent arithmetic logic unit (ALU) operations. 


accumulator B: 40-bit registers that stores the result of an operation and 
provides an input for subsequent arithmetic logic unit (ALU) operations. 


adder: A unit that adds or subtracts two numbers. 
address: The location of a word in memory. 


addressbus: Agroupofconnections used to route addresses. The 54x has 
four 16-bit address busses: CAB, DAB, EAB, and PAB. 


addressing mode: The method by which an instruction calculates the location 
of an object in memory. 


address visibility mode bit (AVIS): A bit in processor mode status register 
(PMST) that determines whether or not the internal program address 
appears on the device’s external address bus pins. 


ALU: arithmetic logic unit. The part of the CPU that performs arithmetic and 
logic operations. 


analog-to-digital (A/D) converter: Circuitry that translates an analog sig- 
nal to a digital signal. 


ARO-AR7: auxiliary registers O—7. Eight 16-bit registers that can be 
accessed by the CPU and modified by the auxiliary register arithmetic 
units (ARAUs) and are used primarily for data memory addressing. 


ARAU: See auxiliary register arithmetic unit. 
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ARP: See auxiliary register pointer. 
ASM: See accumulator shift mode field. 


auxiliary register arithmetic unit: An unsigned, 16-bit arithmetic logic unit 
(ALU) used to calculate indirect addresses using auxiliary registers. 


auxiliary register file: The area in data memory containing the eight 16-bit 
auxiliary registers. See also auxiliary registers. 


auxiliary register pointer (ARP): A 3-bit field in status register 0 (STO) used 
as a pointer to the currently-selected auxiliary register, when the device 
is operating in ’C5x/’C2xx compatibility mode. 


auxiliary registers: Eight 16-bit registers (AR7 — ARO) that are used as 
pointers to an address within data space. These registers are operated on 
by the auxiliary register arithmetic units (ARAUs) and are selected by the 
auxiliary register pointer (ARP). See also auxiliary register arithmetic unit. 


AVIS: See address visibility mode bit. 


B: See accumulator B. 


bank-switching control register (BSCR): A 16-bit register that defines the 
external memory bank size and enables or disables automatic insertion 
of extra cycles when accesses cross memory bank boundaries. 


barrel shifter: A unit that rotates bits in a word. 
BK: See circular buffer size register. 


block-repeat active flag (BRAF): A bit in status register 1 (ST1) that indi- 
cates whether or not a block repeat is currently active. 


block-repeat counter (BRC): A 16-bit register that specifies the number of 
times a block of code is to be repeated when a block repeat is performed. 


block-repeat end address register (REA): A 16-bit memory-mapped reg- 
ister containing the end address of a code segment being repeated. 


block-repeat start address register (RSA): A 16-bit memory-mapped reg- 
ister containing the start address of a code segment being repeated. 


boot: The process of loading a program into program memory. 


bootloader: A built-in segment of code that transfers code from an external 
source to program memory at power-up. 
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BRC: See block-repeat counter. 
BSCR: See bank-switching control register. 


BSP: buffered serial port. An enhanced synchronous serial port that includes 
an autobuffering unit (ABU) that reduces CPU overhead in performing 
serial operations. 


BSPCE: BSP control extension register. A 16-bit memory-mapped register 
that contains status and control bits for the buffered serial port (BSP) in- 
terface. The 10 LSBs of the SPCE are dedicated to serial port interface 
control, whereas the 6 MSBs are used for autobuffering unit (ABU) con- 
trol. 


butterfly: A kernel function for computing an N-point fast Fourier transform 
(FFT), where N is a power of 2. The combinational pattern of inputs 
resembles butterfly wings. 


C: See carry bit. 


C16: A bitin status register 1 (ST1) that determines whether the ALU oper- 
ates in dual 16-bit mode or in double-precision mode. 


carry bit: A bitin status register 0 (STO) used by the ALU in extended arith- 
metic operations and accumulator shifts and rotates. The carry bit can 
be tested by conditional instructions. 


circular buffer size register (BK): A 16-bit register used by the auxiliary reg- 
ister arithmetic units (ARAUs) to specify the data-block size in circular ad- 
dressing. 


code: Asetof instructions written to perform a task; a computer program or 
part of a program. 


cold boot: The process of loading a program into program memory at 
power-up. 


compare, select, and store unit(CSSU): Anapplication-specific hardware 
unit dedicated to add/compare/select operations of the Viterbi operator. 


CSSU: See compare, select, and store unit 
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DAGEN: See data-address generation logic (DAGEN). 


DARAM: dual-access RAM. Memory that can be read from and written to in 
the same clock cycle. 


data address bus: A group of connections used to route data memory 
addresses. The ’54x has three 16-bit buses that carry data memory 
addresses: CAB, DAB, and EAB. 


data-address generation logic (DAGEN): Logic circuitry that generates 
the addresses for data memory reads and writes. See also program-ad- 
dress generation logic (PAGEN). 


data bus: A group of connections used to route data. The ’54x has three 
16-bit data buses: CB, DB, and EB. 


data memory: A memory region used for storing and manipulating data. 
Addresses 00h—1Fh of data memory contain CPU registers. Addresses 
20h—5Fh of data memory contain peripheral registers. 


data page pointer (DP): A 9-bit field in status register 0 (STO) that specifies 
which of 512, 128 x 16 word pages is currently selected for direct address 
generation. DP provides the nine MSBs of the data-memory address; the 
dma provides the lower seven. See also dma. 


data ROM bit (DROM): A bitin PMST that determines whether or not part 
of the on-chip ROM is mapped into data space. 


digital-to-analog (D/A) converter: Circuitry that translates a digital signal 
to an analog signal. 


direct data-memory address bus: A 16-bit bus that carries the direct ad- 
dress for data memory. 


direct memory address (dma, DMA) :_ The seven LSBs of a direct-ad- 
dressed instruction that are concatenated with the data page pointer 
(DP) to generate the entire data memory address. See also data page 
pointer. 


dma: See direct memory address. 
DP: See data page pointer. 


DRB: direct data-memory address bus. A 16-bit bus that carries the direct 
address for data memory. 


DROM: See data ROM bit. 
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exponent encoder (EXP): An application-specific hardware device that 
computes the exponent value of the accumulator. 


external interrupt: A hardware interrupt triggered by a pin (INTO-INT3). 


fast Fourier transform (FFT): Anefficient method of computing the discrete 
Fourier transform, which transforms functions between the time domain 
and frequency domain. The time-to-frequency domain is called the for- 
ward transform, and the frequency-to-time domain is called the inverse 
transformation. See also butterfly. 


fastreturnregister(RTN): A 16-bit register used to hold the return address 
for the fast return from interrupt (RETF[D)]) instruction. 


general-purpose input/output pins: Pins that can be used to supply input 
signals from an external device or output signals to an external device. 
These pins are not linked to specific uses; rather, they provide input or 
output signals for a variety of purposes. These pins include the general- 
purpose BIO input pin and XF output pin. 


hardware interrupt: An interrupt triggered through physical connections 
with on-chip peripherals or external devices. 


host port interface (HPI): An 8-bit parallel interface that the CPU uses to 
communicate with a host processor. 


HPI conirol register (HPIC): A 16-bit register that contains status and con- 
trol bits for the host port interface (HPI). 


IFR: See interrupt flag register. 


IMR: See interrupt mask register. 
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interrupt: A condition caused either by an event external to the CPU or by 
a previously executed instruction that forces the current program to be 
suspended and causes the processor to execute an interrupt service 
routine corresponding to the interrupt. 


interrupt flag register (IFR): A 16-bit memory-mapped register that flags 
pending interrupts. 


interrupt mask register (IMR): A 16-bit memory-mapped register that 
masks external and internal interrupts. 


interrupt mode bit (INTM): A bit in status register 1 (ST1) that globally 
masks or enables all interrupts. 


interrupt service routine (ISR): A module of code that is executed in 
response to a hardware or software interrupt. 


IPTR: interrupt vector pointer. A 9-bit field in the processor mode status regis- 
ter (PMST) that points to the 128-word page where interrupt vectors 
reside. 


IR: instruction register. A 16-bit register used to hold a fetched instruction. 


latency: The delay between when a condition occurs and when the device 
reacts to the condition. Also, in a pipeline, the necessary delay between 
the execution of two instructions to ensure that the values used by the 
second instruction are correct. 


LSB: /east significant bit. The lowest order bit in a word. 


maskable interrupts: A hardware interrupt that can be enabled or disabled 
through software. 


memory map: A map of the addressable memory space accessed by the 
’54x processor partitioned according to functionality (memory, registers, 
etc.). 


memory-mapped registers: The '54x processor registers mapped into 
page 0 of the data memory space. 


microcomputer mode: A mode in which the on-chip ROM is enabled and 
addressable for program accesses. 
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microprocessor/microcomputer (MP/MC) bit: A bit in the processor 
mode status register (PMST) that indicates whether the processor is op- 
erating in microprocessor or microcomputer mode. See also microcom- 
puter mode; microprocessor mode. 


microprocessor mode: A mode in which the on-chip ROM is disabled for 
program accesses. 


micro stack: A stack that provides temporary storage for the address of the 
next instruction to be fetched when the program address generation logic 
is used to generate sequential addresses in data space. 


MSB: most significant bit. The highest order bit in a word. 


multiplier: A 17-bit x 17-bit multiplier that generates a 32-bit product. The 
multiplier executes multiple operations in a single cycle and operates us- 
ing either signed or unsigned 2s-complement arithmetic. 


nested interrupt: A higher-priority interrupt that must be serviced before 
completion of the current interrupt service routine (ISR). An executing 
ISR can set the interrupt mask register (IMR) bits to prevent being sus- 
pended by another interrupt. 


nonmaskable interrupt: An interrupt that can be neither masked by the 
interrupt mask register (IMR) nor disabled by the INTM bit of status regis- 
ter 1 (ST1). 


overflow: A condition in which the result of an arithmetic operation exceeds 
the capacity of the register used to hold that result. 


overflow flag: A flag that indicates whether or not an arithmetic operation 
has exceeded the capacity of the corresponding register. 
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PAGEN: See program-address generation logic (PAGEN). 


PAR: program address register. A 16-bit register used to address the pro- 
gram-memory operands in FIRS, MACD, MACP, MVDP, MVPD, READA, 
and WRITA instructions. 


PC: program counter. A 16-bit register that indicates the location of the next 
instruction to be executed. 


pipeline: A method of executing instructions in an assembly-line fashion. 


pma: program memory address. A register that provides the address of a 
multiplier operand that is contained in program memory. 


PMST: processor mode status register. A 16-bit status register that controls 
the memory configuration of the device. See also STO, ST7. 


pop: Action of removing a word from a stack. 


program-address generation logic (PAGEN): Logic circuitry that gener- 
ates the address for program memory reads and writes, and the address 
for data memory in instructions that require two data operands. This cir- 
cuitry can generate one address per machine. See also data-address 
generation logic (DAGEN). 


program counter (PC): A 16-bit register that indicates the location of the 
next instruction to be executed. 


program controller: Logic circuitry that decodes instructions, manages the 
pipeline, stores status of operations, and decodes conditional operations. 


program memory: Amemory region used for storing and executing programs. 


push: Action of placing a word onto a stack. 


RC: repeat counter. A 16-bit register used to specify the number of times a 
single instruction is executed. 


register: A group of bits used for temporarily holding data or for controlling 
or specifying the status of a device. 


repeat counter (RC): A 16-bit register used to specify the number of times 
a single instruction is executed. 
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reset: Ameans of bringing the CPU to a known state by setting the registers 
and control bits to predetermined values and signaling execution to start 
at a specified address. 


REA: _block-repeat end address. A 16-bit register that specifies the end ad- 
dress of a code segment to be repeated in repeat mode. 


RSA: block-repeat start address. A 16-bit register that specifies the start 
address of a code segment to be repeated in repeat mode. 


RTN: fast return register. A 16-bit register used to hold the return address for 
the fast return from interrupt (RETF[D]) instruction. 


SARAM: §single-access RAM. Memory thatcan be read written once during 
one clock cycle. 


serial port interface: An on-chip full-duplex serial port interface that pro- 
vides direct serial communication to serial devices with a minimum of ex- 
ternal hardware, such as codecs and serial analog-to-digital (A/D) and 
digital-to-analog (D/A) converters. Status and control of the serial port is 
specified in the serial port control register (SPC). 


shifter: A hardware unit that shifts bits in a word to the left or to the right. 


sign-control logic: Circuitry used to extend data bits (signed/unsigned) to 
match the input data format of the multiplier, ALU, and shifter. 


sign extension: An operation that fills the high order bits of a number with 
the sign bit. 


software interrupt (SINT): Aninterrupt caused by the execution of an INTR 
or TRAP instruction. 


software wait-state register (SWWSR): software wait-state register. A 
16-bit register that selects the number of wait states for the program, 
data, and I/O spaces of off-chip memory. 


SP: stack pointer. A register that always points to the last element pushed 
onto the stack. 


STO: A 16-bit register that contains ‘54x status and control bits. See also 
PMST; ST1. 


ST1: A16-bit register that contains ’54x status and control bits. See also 
PMST, STO. 
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stack: A block of memory used for storing return addresses for subroutines 
and interrupt service routines and for storing data. 


stack pointer (SP): Aregister that always points to the last element pushed 
onto the stack. 


temporary register (T): A 16-bit register that holds one of the operands for 
multiply operations, the dynamic shift count for the LACT, ADDT, and 
SUBT instructions, or the dynamic bit position for the BITT instruction. 


time-division multiplexing (TDM): The process by which a single serial 
bus is shared by up to eight 54x devices with each device taking turns 
to communicate on the bus. There are a total of eight time slots (chan- 
nels) available. During a time slot, a given device may talk to any com- 
bination of devices on the bus. 


transition register (TRN): A 16-bit register that holds the transition decision 
for the path to new metrics to perform the Viterbi algorithm. 


TSPC: TDM serial port control register. A 16-bit memory-mapped register 
that contains status and control bits for the TDM serial port. 


wait state: A period of time that the CPU must wait for external program, 
data, or I/O memory to respond when reading from or writing to that ex- 
ternal memory. The CPU waits one extra cycle (one CLKOUT1 cycle) for 
every wait state. 


warm boot: The process by which the processor transfers control to the 
entry address of a previously-loaded program. 


XF: XF status flag. A bit in status register ST1 that indicates the status of the 
XF pin. 


XPC: program counter extension. A register that contains the upper 7 bits 
of the current program memory address. 


zero fill: Amethod offilling the low or high order bits with zeros when loading 
a 16-bit number into a 32-bit field. 
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’‘AC01 initialization, example |10-38}to| 10-41 
‘ACO1 register configuration, exam- 
ple |10-42|to}10-45 


14-pin connector, dimensions |A-15 
14-pin header 
header signals |A-2 
JTAG ial 
16-bit/8-bit parallel boot 
256-point real FFT initialization, exam- 
ple [10-84]to[10-86] 


256-point real FFT routine, exam- 


ple |10-91|to|10-96 


A/D converter, definition 
accumulator, definition 
accumulator A, definition 
accumulator B, definition 
adaptive filtering using LMS instruction, exam- 
ple |10-74}to|10-83 
add two floating-point numbers, exam- 
ple |6-25]to|6-31] 
adder, definition 
addition 
address, definition 
address visibility, definition 
addressing mode, definition 
analog-to-digital converter, definition 
applications 
adaptive filtering, implementing adaptive FIR fil- 
> [a-10] g, Imp g p 


channel decoding 


branch metric equation 
codebook search 
convolutional encoding 7-4 
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pointer and storage scheme P-4 
trellis diagram 7-5 
using the Viterbi algorithm P-4 
speech coder {7-2 
CELP-based 7-2 
code vector localization (equation) 
code-excited linear predictive (CELP) 
linear predictive coding (LPC) synthesis 
using codebook search FA 
ARO-AR7, definition 
ARAU. See auxiliary register arithmetic unit 
ARAUs, definition 
arithmetic logic unit (ALU), definition 
ARP. See auxiliary register pointer 
ASM. See accumulator shift mode field 
assistance 
auxiliary register file, definition 
auxiliary register pointer, definition 
auxiliary registers, definition 
AVIS 
See also address visibility mode bit 
definition 


B. See accumulator B 

bank-switching control register, definition 

barrel shifter |C-2 

BIO pin 

bit reversal routine, example [10-87]to[10-90] 

BK. See circular buffer size register 

block diagrams 
external interfaces, 541 [1-5] 
implementing an adaptive FIR filter 4-12 
pointer management storage scheme 
speech coder [7-2| 

block repeat active flag, definition 
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block repeat active flag (BRAF) bit 
block repeat counter, definition 
block repeat end address (REA), definition 
block repeat start address (RSA), definition 
block repeat start address register, definition 
boot, definition 
boot loader, definition 
boot mode selection [8-2] 

process |8-3 
BRC. See block repeat counter 
BRE bit [8-29 
BSP serial port control extension register 


CLKP bit 
diagram 


HALTX bi 
PCM bit 
RH bit 

XH bit [8-29 


buffered serial port (BSP), definition 
buffered signals, JTAG 

buffering 

bus devices 

bus protocol |A-4 

butterfly, definition 

BXE bit 


C bit 

C compiler |B-2 

C16, definition 

cable, target system to emulator [A-1]to[A-25] 
cable pod 

carry bit, definition 

CELP-based speech coding 
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central processing unit (CPU), memory-mapped reg- 
isters 

circular buffer size register, definition 

CLDV bits 

CLKP bit 

code, definition 

code generation tools |B-2 

codebook search 
example 

cold boot, definition 

compare, select and store unit (CSSU), defini- 
tion 

compiler [B-2 


compute power spectrum of complex 256-point real 
FFT output, example |10-103)to}10-105 
configuration, multiprocessor |A-13 


connector 
14-pin header [A-2] 


dimensions, mechanical 


DuPont 
convolutional encoding, trellis diagram to|7-7 


data address bus, definition 
data bus, definition 
data memory, definition 
data page pointer, definition 
data page pointer (DP), definition 
data ROM bit, definition 
data ROM bit (DROM), definition 
data transfer — host action, example |9-13|tq 9-14 
data transfer — target action, example 
data transfer from FIFO, exam- 

ple [10-106]to[10-110] 
data-address generation logic, definition 
debug tools 
debugger. See emulation 
development tools |B-2 
device nomenclature 

diagram |B-6 

prefixes 
diagnostic applications [A-24] 
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dimensions 
12-pin header |A-20 
14-pin header |A-14 
mechanical, 14-pin header |A-14 
direct data-memory address bus, definition 
direct data-memory address bus (DRB), defini- 
tion 
direct memory address, definition 
display data on screen, example 
divide a floating-point number by another, exam- 
ple |6-37|to|6-42 
division and modulus algorithm 
dual-access RAM (DARAM), definition 
DuPont connector |A-2 


echo the input signal, example |10-56}to|10-58 


EMUO/1 
configuration 
emulation pins |A-20 
IN signals |A-20] 


rising edge modification 
EMU0/1 signals 3 ed Bed DS 


emulation 
JTAG cable 
timing calculations [A-7|to A-9][A-18]to[A-24 
emulator 
connection to target system, a TAG mechanical 
dimensions [A-14ltolA-25 
designing the JTAG cable Ae 
emulation pins 
pod interface 
signal buffering |A-10|to|A-13 
target cable, header design tolA-3] 
emulator pod, timings |A-6 
exponent encoder, definition 
extended-precision arithmetic |6-17|to 6-23] 
addition/subtraction 
32-bit addition [6-18 


64-bit subtraction 

multiplication 
32-bit fractional multiplication __\6-23 
32-bit integer multiplication (6-23 
32-bit multiplication 
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external interface, 541 {1-5 


fast Fourier transform (FFT) |C-5 

fast return register, definition 

fast return register (RTN), definition 
FE bit 

FIG bit 

floating-point arithmetic |6-24 

FSP bit 

function calls on host side, exam- 


ple [10-146ho[ 0-117 


generation of cosine wave, example [6-12|to 
generation of sine wave, example 
graphic drivers routine, example 


HALTR bit [8-29 
HALTX bit 
handshake — host action, example to 
handshake — target action, example [9-8)to|9-9| 
handshake between host and target, exam- 

ple [10-25]to 
header 


14-pin 
dimensions 14-pin |A-2 
host control register (HCR) 
bit summar 
diagram 
host port interface, definition 
host port interface boot loading sequence 
HPI control register (HPIC) |C-5 


/O boot |8-8 


IEEE 1149.1 specification, bus slave device 
rules 


initialization of serial port 1, exam- 
ple |10-33}to}10-37 
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initialization of variables, pointers, and buffers, ex- 
ample |10-29jto|10-32 


instruction register (IR), definition 


interrupt, definition 
interrupt 1 service routine, exam- 


ple |10-111|to}10-115 
interrupt flag register (IFR) |C-6 
definition 


interrupt mask register (IMR) |C-6 
definition 


interrupt mode (INTM) bit |C-6 

interrupt service routine, definition 
interrupt vector pointer (IPTR), definition 
interrupts 


hardware 
nested 


nonmaskable |C-7 
user-maskable (external) |C-5 


JTAG 
JTAG emulator 
buffered signals [A-10| 
connection to target aa [A-1]to[A-25] 


no signal buffering |A-10 


latency, definition 
least significant bit (LSB), definition 
linker command file for the application, exam- 


ple [10-124}to/10-126 


logical operations |6-43 


low-pass biquad IIR filter, example |10-69]to|10-73 
low-pass filter using MAC instruction, exam- 
ple |10-59}to}10-63 


low-pass symmetric FIR filtering using FIRS instruc- 
tion, example |10-64|to|10-68 


main function call on host side, exam- 


ple |10-118}to}10-120 
main program that calls different functions, exam- 
ple |10-16}to|10-21 
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memory allocation, example [10-10|to[10-15] 
memory map 
memory map of TMS320C541, exam- 

ple [10-12 7fo 10-128 
memory-mapped registers, defined 
micro stack, definition 
microcomputer mode, definition 
microprocessor mode, definition 
microprocessor/microcomputer (MP/MC) bit 
most significant bit (MSB), definition 
multiplication 
multiplier, definition 


multiply two floating-point numbers, exam- 
ple |6-32|to 


nested interrupt 
nomenclature_ |B-6} 
prefixes 


output modes 
external count 
signal event 

overflow, definition 

overflow flag, definition 


PAL 
part numbers, tools 
part-order information 


PCM bit 


pipeline, definition 
pop, definition 
processor initialization, example |10-22}to}|10-24 


processor mode status register (PMST) 
definition 
MP/MC bit 
program address register (PAR), definition 
program controller, cotton a 
program counter, definition 
program counter (PC), definition 
program counter extension (XPC), definition 
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program memory, definition 

program memory address (pma), definition 

program-address generation logic, definition 

program-address generation logic (PAGEN), defini- 
tion 

protocol, bus |A-4 


push, definition 


receive interrupt acl routine, exam- 
ple [10-46fo[10-50 


regional technology centers 

register 
BSP control extension (BSPCE) |C-3 
definition 
host port interface control (HPIC) 
interrupt flag (IFR) |C-6 
interrupt mask (IMR) 

repeat counter, definition 

repeat counter (RC), definition 


reset, definition 


RH bit 
RTCs_ [B-4| 


run/stop operation 


au NB bugger co command [A-20][A-21]|A-22] 


ere input |A-22 


scan path linkers |A-16 
secondary JTAG ei a to an SPL [A-17] 
suggested timings 
usage 

scan paths, TBC emulation connections for JTAG 
scan paths 

seminars |B-4 

serial port interface |C-9 

shifter, definition 

sign control logic, definition 

sign extension, definition 

signal descriptions 14-pin header |A-3 
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Index 


signals 


buffered [A-10] 


buffering for emulator connections [A-10|to[A-13] 
description 14-pin header [A-3] 
timing 
sines and cosines 
single-access RAM (SARAM), definition 
single-instruction repeat loops, memory-to-memory 
block moves 
SINT. See software interrupt 
slave devices 
software development tools 
assembler/linker |B-2 


general 
linker 


software interrupt, definition 
software wait state register (SWWSR), defini- 
tion 
square root computation, example [6-14]to[6-16] 
square roots 
stack, definition 
stack pointer, definition 
stack pointer (SP), definition 
standard serial boot |8-10 
status register 0 (STO), INTM a 
straight, unshrouded = 14-pin 
subtraction 
support tools 
development 
device 
support tools nomenclature, prefixes 
system interface, 541 external interface 


system-integration tools 


target cable |A-14 
target control register (TCR) 
bit summar 
diagram 
target system, connection to emulator [A-1] to[A-25] 
target-system clock 


task scheduling, example to 


TC gna 
A-18)|A-25 
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rosa 
A-18 


TDM serial port control register (TSPC), defini- 
tion 
TDO output 
TDO signal 
temporary register (T), definition 

test bus controller 


test clock |A-12 
diagram |A-12 
third-party support 
time-division multiplexing (TDM), defined |C-10 


timing calculations [A-7|to[A-9} [a-18] to A-26) 


TMS, signal |A-4 
TMS signal_[A-2||A-3 
A-17 


TMS/TDI inputs |A-4 

TMS320C541, interface, external 
tools, partnumbers_ |B-7 

tools nomenclature, prefixes 
transition register, definition 


TRST signal 


unpack 256-point real FFT output, exam- 
ple |10-97|to|10-102 
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unsigned/signed integer division, exam- 
ple to|6-8 


vector table initialization, example |10-6|to 


Viterbi algorithm (channel decoding) {7-5 
Viterbi operator for channel coding, example 


warm boot 
definition 
example [8-12|ta 8-22] 

workshops 


XDS510 emulator, JTAG cable. See emulation 


XF pin 
XF status flag (XF), definition 
XH bit 


zero fill, definition 
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